Gene Acid345_4483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4483 
Symbol 
ID4070966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5321235 
End bp5323448 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content59% 
IMG OID637986522 
ProductTonB-dependent receptor 
Protein accessionYP_593557 
Protein GI94971509 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000231942 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCCGTC TGCTCTTATC TTGCTGCCTA CTTTTTCCGG CAGCGCTTTT CGCCGGTGAA 
CTCACTATCC ATGTGGTTGA CCCCGACCAG CGTCCGGTCG TGAATGCCAA CGTCGCGCTT
TATTCCAGCA ATCATTCCGA GGTCCGCACT ACCGGCGCTG ACGGTGCTGC GCATTTCTCA
AACGTGAGCG ATGGCAGTTA TAACGTGCAG GTGCTCGCGC CGGGGTTCGC CAAAGCCGAA
CAACCGGTCC AACTTCCTGA ACGCAGCGAT TCCACTGTTG CACTTTCTGT CGAAAACGCC
GCCGAGAGCG TGGTAGTAAC GGCTACGTCC AGCCCCGTAT CGGAAGCTGA GTCTGGCTCC
GCGGTTTCCA CTCTGGACGC CCAACAACTC ACGCTGAAGC AGGCTACCGC CGCCAGCGAT
GCACTGCGCT TTATGCCCGG CGTACTCGTT ACCGCCACCG GCCAGCGCGG ATCGTTGACG
ACGGTCAACG TGCGCGGTGG AGAGTCGCGC TATAACCACG TAATTGTGGA TGGCGTTCCG
GTGAACGAAC CCGGTGGCCA GTTCGACTTC GGCGTTGTGC CGACGGCTCA AATGGACCGC
ATCGAGGTGG TTCGCGGCTC CGACAGCGCG GTTTATGGCT CGGACGCGAT GAGCAGCGTG
GTGCAGATGT TCAGCGCCAC CGGAACCACG CGGACCCCTG AGTTTCGCTT CGGCGCCGAC
GGCGGCAACT TCGGCACGGC ACACGGATTT CTCTCCATTG CCGGCGCGTG GCGGCGTTTC
GATTACAACC TTTTCGGCGA TCAGTTCAAT ACCAACGGCC AGGGACTCAA CAACGCCTAT
TCCAACGGCC TCGAAGGTTT GAATCTTGGC TATCGCGTGA ATCAAAGGGC GCAGCTGCGC
TTCCGGCTGC GTCATGCGAA CAGCTGGAGC GGTACCTCGA ACGAGTGGTG GTTTAACGGG
GACGCTGCCC TACCGGCGGA TTCCGATCAA TACGCGCGAC AGACGAATTT TCTCGCCGAT
CTCGATCTCA CCATCGCGGG TCCCGGCGCA TGGCAGCACC GCTTCAGCGG CTTTGAGTAC
AACCACGACC GGCGCAATGT GGACAGTTTC GTTGATCCCG GCCGTCCAGC AGATTTCGAT
CAGCCCTTCG ATTCGGCGGC GCTTTACAAC CGCGCAGGCT TTGATTGGCA GTCGGACTAT
TCTCCACGCA GTTGGACGCG GACTTCGATC GGCTATCACT TCGAAAAAGA GAACGGGAAT
ATAACCAGCA ACTACTCGTT CTTCGGATTC CCTGAGTACA GCGTCACAAT TGGGCAGCGT
AACAACCAGG CTGTGTTTGG CCAGCAGATG CTGCTCTGGA AGCGCTTCAG CCTGCTCGCT
GGCCTGCGCT GGGAGCACAA CGAGAGCTTC GGCGATAAAG CGATTCCGCG CGCTGCCCTT
AGCTTTGTTG TGTTGCGCGG CGGCGAGATT TTCAGCGGCA CGCGTCTTCG CGGTTCTTAT
AGTACCGGCG TGGTAGAACC GAGTTTCGAA GAGACTTTCG GCATCTCGGG TACGTTTCCC
ACGCTGCCGA ATCCGGACCT GAAGCCCGAG CAGGCACGCT CGTTTGAGGC CGGATTGGAG
CAAGGTTTCC TGGCGAACAA GGTCTCGCTC TACGCGGCCT ACTACAACAG CATTTACCGC
GACCAGATCC AGTTCTACTT TGACCCAATC ACGTTCAACA GCCAATACCG CAACATCAAT
CGCGCCCTGG CACACGGTGC GGAAGTCGAC ATTCAAGCGC GCCTGAACAA GAGCCTCTCA
GTCAGCGCGA ACTACACGTA TACGTCGTCG CAGATCCTGA GTGCCCTGCC GTGCGATCCG
GCGGCAGGAT GCGATCCGCG TCTCTTTGGC GAAGGCAGCC CGCTGCTACA TCGACCACGG
CACTTCGGCA ACTTGATGCT GAGCTATTCG CGTAGCCGCT GGGGTGCGCA ACTCGCGGGT
GTGGCTGTGG GGCGTCGTGC CGACGACGAT TTTGGCCTTG CTCCCGCACC CATTTCCTAC
GCTGCGGGTT ATGCGCGCTT CGACGCTTCA GGCTATTACA CGGTGAGTAC GCACGTGACT
GCCTACGTGA ATATGGAGAA CCTGCTGAAC CACTACTACA ACGAAGTTGT CGGGTATCCG
TCGCTGGGCT TCAACTTCCG CGCGGGATTA CGCTTCCGCT TCGGTGGCGA ATAG
 
Protein sequence
MRRLLLSCCL LFPAALFAGE LTIHVVDPDQ RPVVNANVAL YSSNHSEVRT TGADGAAHFS 
NVSDGSYNVQ VLAPGFAKAE QPVQLPERSD STVALSVENA AESVVVTATS SPVSEAESGS
AVSTLDAQQL TLKQATAASD ALRFMPGVLV TATGQRGSLT TVNVRGGESR YNHVIVDGVP
VNEPGGQFDF GVVPTAQMDR IEVVRGSDSA VYGSDAMSSV VQMFSATGTT RTPEFRFGAD
GGNFGTAHGF LSIAGAWRRF DYNLFGDQFN TNGQGLNNAY SNGLEGLNLG YRVNQRAQLR
FRLRHANSWS GTSNEWWFNG DAALPADSDQ YARQTNFLAD LDLTIAGPGA WQHRFSGFEY
NHDRRNVDSF VDPGRPADFD QPFDSAALYN RAGFDWQSDY SPRSWTRTSI GYHFEKENGN
ITSNYSFFGF PEYSVTIGQR NNQAVFGQQM LLWKRFSLLA GLRWEHNESF GDKAIPRAAL
SFVVLRGGEI FSGTRLRGSY STGVVEPSFE ETFGISGTFP TLPNPDLKPE QARSFEAGLE
QGFLANKVSL YAAYYNSIYR DQIQFYFDPI TFNSQYRNIN RALAHGAEVD IQARLNKSLS
VSANYTYTSS QILSALPCDP AAGCDPRLFG EGSPLLHRPR HFGNLMLSYS RSRWGAQLAG
VAVGRRADDD FGLAPAPISY AAGYARFDAS GYYTVSTHVT AYVNMENLLN HYYNEVVGYP
SLGFNFRAGL RFRFGGE