Gene Acid345_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1494 
Symbol 
ID4071664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1816724 
End bp1818472 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content64% 
IMG OID637983503 
Productputative GAF sensor protein 
Protein accessionYP_590570 
Protein GI94968522 
COG category[S] Function unknown 
COG ID[COG5373] Predicted membrane protein 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.980694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGATT CTGTAACAAA GGACTCGGAA TTCGAGATCC AGTATCCGCC CTCAATACTC 
GAAGACGTCA CAAAACTCGT TCAACGGGTT CAGACGTTTA CGGGAGCGGC CGGCGCAGCG
ATCGCGCTGC GCGAGGGGGA GGACATGGTT TGCCGCGGCA GCCGCGGCAA CAACGCCCCC
GACGTTGGAA TGGTTCTAAG CACCGACGGA ACGTTCACTG GCCTGGCCGT TACCGGAATG
AAAGCGGTGC GTTGCGACGA CACCGAAAAT GATTCGCGCG TCGACCCCGA AATCAGCCGC
GCCCTCCGCA TCAAGTCGAT GGCCGTAGTG CCCGTACTGA GCGGAATGCG CGTCAGTGGC
GTGATTGCGA CGTTCTCGAG CGCTGCCAGT GCCTTCAGCG ACACTCATAT GGCCGTCCTC
AAGACGATGG CCGATGGCTT GGGCGCTTCG ATCCAGCGCT TCCTGGAAGT CCAGGGAATT
AGCACCGGCG CACCGATGGT CTCCGCCGCG GCAGCAGCCG CACCGGCCCC GGTCGCCAGG
CCGCAGGTTG CACCGCCACC TCCGCCGCCT CCGGCGCCCA AGGTGGAAGC ACCGCGTCCG
GCACCGCCGC CACCACCGGC TCCGCCGAAA ATTGAAGCTC CGCGCCCAAC ACCGCCGCCT
CCGCCGGCAC CGAAGATTGA AGCTCCGAAG CTGGCCCCGC CGGCGCCTGC TCCGATGCCA
GTTGCCGCCG CTCCAGCGCC TGCAGTCGAG CGCGCTCCTG AGCCGCCGAA GCCACCGCCG
CAGCCACCGA AGCGGCAGGA ACAGAAACAG CAGGGCAAAT GGAAACCGGT AGCGCCTCCG
AAGCAGGAGG AAGAGGCTCC GGTGATCGAG AAGCCCGCGC CGAAGCCGGA ACCCAAGGCG
CAACCGAAGC CGGAACCGAA ACCTGAGCCC AAACCGCAGC CGAAGGCCGA GCCGGTCATC
GCTGCCCCCT CGTTCAGCTA CGAAGCGAAA ACCGAGGAAG GCGAAGGCGG CGGAAACAAG
GGCATGATTT TCGGCGGCGT CGCAGCCGCA GTGCTGGTAA TCGCGATCGG CGGATACTTC
ATGATGGGCA AGAAGTCGAG CCCAGCGCCT GCTCCGCCCA CTACTACGAC GCAACCGGCG
CCTGAGAACA CCGCGAACAC AACGCCCGCC TCCAACGTAA CTGGAACGGT GACGACTGGA
GCAAACCCCG CGGCCGCAAA CAACACCAAG CCGCAGGACC AGGGCAAGAA CAACAACAAC
ACGACCGCCA GCAAGCCGGA AGAGCAGCAG CAGGCGAAGC CGGCAGCCGC TCCGCTGGTT
GTAGGCTCCG CACCTTCGGC GAGCAACAAG CCGCAGCAGG TTGCAGATGT CTCCGCGCCC
TCGCTGAACC TGGCAGGCGC AGCCGGTGCG GGTCCGAATC TTGACGTTCC GGTAACCAGC
TCAGCGCCGA AGCTTTCCGC TCCTGCGCCG GCCAATGCCG TAATCGTTCC AAGCCGCCTG
GTCCAGCGCG TGAACCCGAA CTATCCGCAG TCCGCCAAGC AGTACCGCAT TGAAGGCGCG
GTAACGCTGA GCGCGACCAT CGGATCTGAC GGACACGTGA AGGACGCGAA AGTACTGAAC
GGGCCGCCGA TGTTGCGCGA CTCCGCACTC AACGCAGTTC GCCAATGGAA ATATGCTCCT
TCCACGGTGA ACGGGCGTCC AGTCGAATCG AGCGTACAGA TCGTGCTCCA GTTCAAGATG
CCTAGCTAA
 
Protein sequence
MPDSVTKDSE FEIQYPPSIL EDVTKLVQRV QTFTGAAGAA IALREGEDMV CRGSRGNNAP 
DVGMVLSTDG TFTGLAVTGM KAVRCDDTEN DSRVDPEISR ALRIKSMAVV PVLSGMRVSG
VIATFSSAAS AFSDTHMAVL KTMADGLGAS IQRFLEVQGI STGAPMVSAA AAAAPAPVAR
PQVAPPPPPP PAPKVEAPRP APPPPPAPPK IEAPRPTPPP PPAPKIEAPK LAPPAPAPMP
VAAAPAPAVE RAPEPPKPPP QPPKRQEQKQ QGKWKPVAPP KQEEEAPVIE KPAPKPEPKA
QPKPEPKPEP KPQPKAEPVI AAPSFSYEAK TEEGEGGGNK GMIFGGVAAA VLVIAIGGYF
MMGKKSSPAP APPTTTTQPA PENTANTTPA SNVTGTVTTG ANPAAANNTK PQDQGKNNNN
TTASKPEEQQ QAKPAAAPLV VGSAPSASNK PQQVADVSAP SLNLAGAAGA GPNLDVPVTS
SAPKLSAPAP ANAVIVPSRL VQRVNPNYPQ SAKQYRIEGA VTLSATIGSD GHVKDAKVLN
GPPMLRDSAL NAVRQWKYAP STVNGRPVES SVQIVLQFKM PS