Gene Acid345_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2071 
Symbol 
ID4069922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2482142 
End bp2484031 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content60% 
IMG OID637984086 
ProductTonB-dependent receptor 
Protein accessionYP_591146 
Protein GI94969098 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00162344 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000156466 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAAGTT CGATTGTACT GTTATTTCTT TGCATTTCTT TAGGTGCTTT GGCGCAGCAA 
GGGCCTACCC CTCCGCAAAC GGCAGCAGGC GGACCTCCAC CCACGCAGCC GCAGCCCGAC
GAAGTGGTGG TCGTGACCGG AACCTGGGAG CCGATGCCGC TCGAAGATCT CCAACGCTCG
GTGCAGTCCG TTGATGTGCA GGCTGCTCCG CTCTTGTTCT CGAGCACTGC GCAGTTTCTG
CAACTCGATC CGTCGGTAGA TGTGCGGCAG CGCGCTCCGG GCGGTGACCA GGCAGACCTG
TCGATCCGGG GCTCGGCCTT CGAGCAATCG CTGGTGCTGA TTGACGGTCT CCGCGTGAAT
GACGCTCAGA CCGGCCACCA CAACCTCGAC CTCCCGATTC CGCTCGACAC TATCAGCCGC
ATCGAAGTCC TGCATGGCGC GGGTTCGACG TTCTATGGCG CCGATGCGCT GGGCGGCGCG
GTGAACTTTA TCACCGCTCC GGCTGCGACC AGCGAACTAC GCTTGCGTGC GGGTTTCGGC
AACTTCGGCT ACAACGAGCA GCGTGCCGTC GCTTCCCACG CGACGAAGAA CTTCAGCGAG
CAGCTCATCG GCGATCGCAG CTTCTCGACG GGCTTCATAG AAGACCGCGA CTTCCGGAAC
GCGGCCGTGT CGAGCGAGAC CCATTTCCAT ACCGCGCTCG GCGACACAAT GTTTCTCCTT
GCGACCTCCG ACCGACCCTA CGGAGCGAAC CAGTTTTACG GTCCGTTCGA TTCGTGGGAG
CGAACCAAGG CGTGGTTCGT GGCGTGGACA CAGGACCTCG GCAAGCAGAC GGCTTTCGAC
TTTGGTTACC GCCGCCACAC CGATGAGTTT GTGCTGCTCC GCGAGGCACC CAGCGTTTAT
GAGAACAACC ATGTGACCGA TAGCTGGCAG GGTGCCCTTC GCCGTCACGA CGAAATTGGC
AAGGTCACGA CGATTTCTTA TGGCGCGGAA GGCTATCGCG ATCAGATCGA CAGCAACAAT
CTCGGATATC ACGGCCGCAA TCGCGGCGCA GTGTATGCCG CCGCCGATTT CCGAATGATC
AAGCGCTTCT CGCTCTCCGT GGGCGCTCGC GAGGAGTCCT ACAACGGGAC CAAGGGACAG
TTCACACCGT CGGTGAGTGC GGCGTACTGG TTCGCGCCGT CATTCAAAGT AAGGGGCGCA
GTGAGCCGCG GCTTCCGTAT TCCAACCTAT ACCGATCTTT ATTACAGCGA TCCCGCCAAT
GCAGGAAACC CTAACCTTCG TCCGGAGTCG GCGTGGAGCT ACGAAGGCGG CGTCGATTGG
AATGCGGGCG GCAAGATAGC TCTGACGGCG ACAGTATTCC ACCGCCGCGA GCATGACGGC
ATTGACTACG TGAAGTGCGG CTCCGGCTTT ACCTTCGACA TCAATACCGG CACCTGCATC
GCAAGCGGAG TACCGAACGA CGTTTGGCAT GCCTACAACA TCGACAGCCT GAACTTCACC
GGCTTCGAGA CCCTTCTTCG CTATCGTCTT ACGCAGCGCC AGGAGTTCAC CGTGGGTTAT
ACCGGCATTC ACGGTTCGCA GAATGCCGCA CCCCGTGTGC AGTCGCAGTA CGTCTTCAAC
TATCCCGTGA ACAATACTTA CGTAGGATGG CAGGGAAGTG TGTGGCGAGG GATCATCGCG
CGGACGCGTC TCGGCGTGAC CCAACGCTAC GCGCACGATC CCTATGCCCT TTGGGACTTC
TCTGTAGCGA GGGAAGAGGG ACGTATTCGG CCCTACCTGC AGTTCACAAA TCTAACCAGC
ACGACCTATC AGGAAGTCGA TGGCGTCGCG ATGCCGGAGT TCGGCGTGAT CGGTGGCGTA
GAGATCGCGG TCTTCGGCAA GAAGCGTTAA
 
Protein sequence
MRSSIVLLFL CISLGALAQQ GPTPPQTAAG GPPPTQPQPD EVVVVTGTWE PMPLEDLQRS 
VQSVDVQAAP LLFSSTAQFL QLDPSVDVRQ RAPGGDQADL SIRGSAFEQS LVLIDGLRVN
DAQTGHHNLD LPIPLDTISR IEVLHGAGST FYGADALGGA VNFITAPAAT SELRLRAGFG
NFGYNEQRAV ASHATKNFSE QLIGDRSFST GFIEDRDFRN AAVSSETHFH TALGDTMFLL
ATSDRPYGAN QFYGPFDSWE RTKAWFVAWT QDLGKQTAFD FGYRRHTDEF VLLREAPSVY
ENNHVTDSWQ GALRRHDEIG KVTTISYGAE GYRDQIDSNN LGYHGRNRGA VYAAADFRMI
KRFSLSVGAR EESYNGTKGQ FTPSVSAAYW FAPSFKVRGA VSRGFRIPTY TDLYYSDPAN
AGNPNLRPES AWSYEGGVDW NAGGKIALTA TVFHRREHDG IDYVKCGSGF TFDINTGTCI
ASGVPNDVWH AYNIDSLNFT GFETLLRYRL TQRQEFTVGY TGIHGSQNAA PRVQSQYVFN
YPVNNTYVGW QGSVWRGIIA RTRLGVTQRY AHDPYALWDF SVAREEGRIR PYLQFTNLTS
TTYQEVDGVA MPEFGVIGGV EIAVFGKKR