Gene Acid345_4093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4093 
Symbol 
ID4072515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4852941 
End bp4854425 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content62% 
IMG OID637986124 
Productintegrin-like protein 
Protein accessionYP_593167 
Protein GI94971119 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.588322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCA TCACTCGTCT GCTGACACTC GTCTGCAGCA CAGCGTTGAC CGTTGCTGCC 
GGGGCGCAAG CCGCCAATAG TTACAACGGA ACATCGCCGA TCACCTACAA CTACACCACC
CTGACCACGT CCGGCGCCCA GGGTTACTCG GTCGCCTCGT CCGACTTCAA CCGCGACGGC
AACCCTGACC TTGTCGGCGG CACCGAGAAC GCCGTGGACG TATGGCTCGC CACCGGCCGT
GGCACCTACG TGAATTCACC TGTGTCCTAC GCGCTGCCGT TTTCGCCTAC GCACATCGAA
ACGCCCGACT TGAACAACGA CGGCTGGCCG GACATCGTGA CCGCGATTGC CAATGAAGCC
GGCGTAACTG ATGGCGAAAC CGCGGTGTTG CTAAACAACG GCAACGGCAC CTTTCGCATG
GGGACGACGA TTCCGAAGGT CACCGGCCAG CCCATCTGGG TATCAGCCGG CGATCTCAAT
AACGACGGCA ACATCGACCT GGTAGTGGAA GAGCGCATGT TCAATAACGG CGTCCAAACC
GACCAGTTCA TTGTGTACAT GGGCCACGGG AACGGCACCT TCACCAAGGG CCAGGTACTG
AACATGTCAA AGCCCACTAG CCCGCCCGTG CTCGCCGACC TGAACGGTGA CGGCAAACTC
GACATCGTGA ATGCCGAGGG CACCAAGGCG CTGATCTGGC CGGGCAAGGG CGACGGCACC
TTCGGAACGC CGATGAGCCT CCTCCCTCCG AGCGGCGCGG CTTTCAATGA CGTGACCACC
GGCGACTTCA ACAACGATGG CATTCTCGAC CTGGCGCTGG TGTGGTCCAA CGTCTGCGGT
GACGCCTGCG GCGGGCCGAA TAACAACCGT CTGTACATTT ACAAGAACAA CGGCAAGGCC
CAGTTTACGC TGGTCTCCGG GACCAACTTT GGCGGATGCA GCGCCGCCTA CCCGGTCGCG
GCTGACATCA ACGGCGATGG CAACATCGAC ATCAACCTCG TGGGGCCCAG CCATTTCTGC
GGCTTTTCCG AGGTGGCGTT CGGCAATGGC AAGGGGGGCT TCAGCGCGCT GATGAGCGGG
CCTTCCGGTG ACGTAACCTC GGATATGTTC TACCGCGATC TCAACCTCGA CTCTCGGCAC
GACGTAGCGC TCAGTGACAC CATCGGCGGT GATGTTGTAT CAGGCTTGGC GACCAATGGC
TACACCAACT GCGCGCCGCC GACAGCGGCG AACCCTGCGG CGAAGATCTG CTCGCCAACC
GGCAGCTCGT GGCCGGGCAC GTTTACCCTG CGCGCCAGCG GCAATTCACC GTCGGGGATC
GTGCGCATGG AGGTGTGGAT CGACGGCGTG AAGAAGTACC AGAAGTGGAA CGACCAGCTC
GGGAAAAGCT TCACGCTTTC CGCCGGACAG CACCGCATTA CCGTAGTCGC GGTGGACAAG
TACAAGGGCG ACGGCCGCAC CACGGCGATC GTCAACGTGC AGTAG
 
Protein sequence
MRIITRLLTL VCSTALTVAA GAQAANSYNG TSPITYNYTT LTTSGAQGYS VASSDFNRDG 
NPDLVGGTEN AVDVWLATGR GTYVNSPVSY ALPFSPTHIE TPDLNNDGWP DIVTAIANEA
GVTDGETAVL LNNGNGTFRM GTTIPKVTGQ PIWVSAGDLN NDGNIDLVVE ERMFNNGVQT
DQFIVYMGHG NGTFTKGQVL NMSKPTSPPV LADLNGDGKL DIVNAEGTKA LIWPGKGDGT
FGTPMSLLPP SGAAFNDVTT GDFNNDGILD LALVWSNVCG DACGGPNNNR LYIYKNNGKA
QFTLVSGTNF GGCSAAYPVA ADINGDGNID INLVGPSHFC GFSEVAFGNG KGGFSALMSG
PSGDVTSDMF YRDLNLDSRH DVALSDTIGG DVVSGLATNG YTNCAPPTAA NPAAKICSPT
GSSWPGTFTL RASGNSPSGI VRMEVWIDGV KKYQKWNDQL GKSFTLSAGQ HRITVVAVDK
YKGDGRTTAI VNVQ