Gene Acid345_4555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4555 
Symbol 
ID4071500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5400219 
End bp5401328 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content62% 
IMG OID637986595 
ProductTonB-like protein 
Protein accessionYP_593629 
Protein GI94971581 
COG category 
COG ID 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.248985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0574394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATAC TCCAGATCAC GCCTTCCCGA GAAGAAGAAC AGGACAAGCA CTTCGAGGAT 
GTTGCACGCG CTAACGAAGG TAAAGCCAAC TTCGTGGAAG AGGCGTTCAC TCCCGTTGTG
CTCATGGACT TGCGCGACGA GCTCACTCGC TCGCGCCTCC GGGAAGCCGC CTGGATCTCG
ATCATCGCCC ACCTCGTCGC GATCATCTTT CTTAGTCTTA GCCCCAAGTG GATGCCCAAT
CTCTGGGGAC ATCCCGTGAA GGTCGTGGAA GACCGGTTGC GCGACAAGGA CACCACGTTC
CTCGCGCTTC CTCCCGACGC GCAAAAGCTG GTACAGAAGC CACACACCAA CGTTCTCTCC
GATAAAGACC GCGTCGCGAC TTCGCACAAT CCTGATCCGA AAGAGTTGAA GAAACTCCTC
GATCAGCGGC AGCCAGGCCC ACCCGCGCAT CCAGCAGCGC AGCCCAGCGT TCCCGCGCCG
CCGCAAATGG CCCAACAACA GCAGCAGTCG CCGCAGCAGC AAAACCCTGC TACGCAGCAG
GGACAGCAAA CGGCGATGAA CAATCCGCCG CAGTTCGAGA GCCCAAACAT GCAGCCCAAG
ATGACGCTGC CCAAGGCGCA GCCCAGCTTC GGCGCCGTCG CGATGTCGGC CGGCTCAGCG
ATCCAGCAGG CGGCGCGCGC ATCCTCCGGA TCAGCCGGTA AGCTCGCCGT TGGCGGTGGT
ATGGGACTCG GCCGCGGCCC CACCGGCGGG CAAGTTCGCG ACGCGATGGA GATCACGACC
GATACCCAGG GCGTGGACTT CGGCCCCTAT CTCGCACGCA TCAAGCAGAC CATCGAAGCC
AACTGGTACA CCGCAATGCC GGAATCGGTT TATCCGCCAC TGCGCAAGAG CGGCAAGGTC
GCCGTCGAAT TCGTAATTCT CCCCGACGGC AAAGTACAGG GCATGCGCAT CTTCTTCCCG
TCAGGCGACG TCGCACTCGA TCGCGCGGCG TGGGGCGGCA TCTCAGCCTC GAATCCATTC
CCGCCACTGC CCAAAGAATT CCACGGACCG TACCTCGGCC TCCGCTGCTA CTTCCTCTAC
AACCCGACGA CAAAAGACCT CGAGCAATAG
 
Protein sequence
MAILQITPSR EEEQDKHFED VARANEGKAN FVEEAFTPVV LMDLRDELTR SRLREAAWIS 
IIAHLVAIIF LSLSPKWMPN LWGHPVKVVE DRLRDKDTTF LALPPDAQKL VQKPHTNVLS
DKDRVATSHN PDPKELKKLL DQRQPGPPAH PAAQPSVPAP PQMAQQQQQS PQQQNPATQQ
GQQTAMNNPP QFESPNMQPK MTLPKAQPSF GAVAMSAGSA IQQAARASSG SAGKLAVGGG
MGLGRGPTGG QVRDAMEITT DTQGVDFGPY LARIKQTIEA NWYTAMPESV YPPLRKSGKV
AVEFVILPDG KVQGMRIFFP SGDVALDRAA WGGISASNPF PPLPKEFHGP YLGLRCYFLY
NPTTKDLEQ