Gene Acid345_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3840 
Symbol 
ID4070991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4543883 
End bp4547251 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content58% 
IMG OID637985863 
Producthypothetical protein 
Protein accessionYP_592914 
Protein GI94970866 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGA TGAGGCAAGC GATAGCGAGG GCGCGCATAG GCAGTCCTTG GAAGAGTGTG 
AGCACGATCT TGATCATGCT CTGCCTGACG GTCGGAGCGA TGGCACAGAG GATCACCGGC
ACATTGCGCG GTCAAGTAAC CGATTCCGCC GGCAGCGTGG TGGTTGGTGC GAAGGTCACC
GCCGCAAACC AAGACAGCGG GGTCACCGAG AAAACGGCAA CCAACTCGGC CGGCACCTAC
ATATTTCCGG AACTGTTGCC GGGACCGTAT ACCGTGACGG TGCAATCGGA GGGCTTCGCC
ACGAGCGCGG TCCGCGACGT GCGGGTCGCG ACCAATGTGG TCAACGACCG GAACGTCTCA
TTGGCAGTCG GCGGGTCGAC GACCACCATT GATGTAAACG CGGCTGCCGA GACGGTGGAT
CTCTCTTCGT CGACGGTTGC CACTACGTTT GATACGCGAG AAACTCTCGA TATTCCCAGC
GGATCCAACA GTCCGCTGCA ATTGGCATTG TTTTCGGCGA ACACTACGGC GCAACAAGGC
GGCGTAACCG GCACTGGCGG ATCCGTTTCC GGTACACGTC CACGGTCCAA CTCCTTCAAC
ATTGACGGCG TGGACGATAA CAACGCGGGC ACTTCCGGCC AGATCTCGAA CGTCATTCAG
GATGCAGTCG CCGAATTCAA CCTGGTGACG AATCCGTTTT CCGCGGAGTA CGGGCACGCG
GGCGGCGGGC AATTCAACAT CGTCACCAAG ACAGGCACCA ACAGTTGGCA CGGGTCAGGG
GAGTACTACC TGCAGAACCG CTTCCTGAAT GCGCTCGACA ATTTGACGAA AGATGCAATC
GCGCAAGGCG CCATCGACCA CACTCCCAGG CTTGACGTCA GCCGAGTAGG TGGGACGATA
GGTGGCCCGA TCATCAAGAA CCGCTGGTTC ATCTTCGGCG CCTATGAATA CTTCGACCAG
CGCGCGGACA GCCTCGGAGC CGACATAGAA ACGCCTACGG CAGCAGGCAT CAGCACTTTG
CAGTCGCTGG CGGCAACTCC GTACATTGCG AACCTGATTG GCACGCTGCC TGCGGCGCAA
ACCGCAAACT CCGCGCCGCT TCTGGTGAAC GGCGTCAGCA TTCCCACCGG CTTGGTTCCC
CAGGTGGCGC CCAATCCGTT TAAAGAACAT GATTTTCAGA TCAATAGCGA CCTAAAAGAA
GGTCGCCATG AACTGAGCGC GCGCTTCCTG TTCAACAAGC AAGACATCAT AACCGCCGGT
GCGATCAGTA CGCCTGAATA CAATCTGCCT ACCACCCTCA CAAATTACAA AGCGGCCTTG
ATTGACACCT GGTCGATCTC CAACACGCTG GTCAACGACC TGCGGATCTC GTACTCCCAT
TCTCTGCAGA GCCTCGCGGT TCCAGAGCCC TTCAGCAACA ACCCACTCAT TTTCCTGGCC
GACATGAATG GCATCACCTT CGGGGCCAAT GATCCGCAGA GGATCTTGCA GGATGTGTAC
CAGGTCATCG ATACGCAAAC CAAGATTTTC GGCCGGCACA CATTGAAGTA TGGCGGTGAA
TATCGTCATT ACATCGCGCC GGCATTTTTC CTGCAAAGAT CCAACGGGAA CTACTTCTAT
CTTTCGACAC AGACGTTTAT CAACGACGGC GTGCCGGAGA TCCAAATGCT CCGCGGCGCC
GGAGACCCGG TGTTTCCGGA AACGCAGAGT GCCGTCGGGG CATTCCTGCA GGATGACTTC
AAGGTGAGCA ATCGGCTGAC CCTGAACCTC GGTCTTCGAT ATGAGTTCAC CAACAATCCC
TCTGGCGCAG AACGTCAGGC GAAGAACGCA ATTTCAAATG TTCCCGGAGT GATCGACTTT
CACGCACCGA ACACCGCGAA GCTGGATTTC GAACCGCGAA TCGGATTTGC GTGGGACCCC
ACCGGAACCG GCAAGACCTC GGTTCGTGGC GGAATCGGAT TGGGATATTC GCCTCCGGTT
AACAACTTCA ACCAGAATGC CCAGCCGCCG CAGGTACAGA CGGTCCTCAA CCTGGGAACA
GCGTGCTTCG GCGGCCTGAC CACTGCTCCT GCATGGTGCG CTTCAGGAGA CCACTTCTTT
GCGCAGGGAG CACTGCCGTC AACCTATACG CTCGCGCCCG GTCCCGACAT ACCGCGGGCA
CTCACGGCTT CCATCATGCC AGACACGATT GATGCTCGCA TCGTGAACTG GAGCCTCGGC
ATTCAGCGCG AGGTTTACGC TGGTGGCGTT TTGGATGTGC GCTACGTTGG TTCGCGGAGT
TTCCATCTGC CGACCCAGAT CCGACTGAAC AGCATTAGTG CTTTCGATGC CGGTCTGACT
GCGTTGCCGA CCTACTTCAG CAACTCCGAA GTCCCGTCGG CCGTTCCGAA CCCTGCATCC
ACGCAAGCGG ATTTCAAAAC CTTCCTTGCA AACCAGGGGT TCGCGCCGTA TTCACAGTAC
GGATTCGTCA ACGTGCTGAC CGAGATCGGG CCGTTTGGTG CGAGCGTCTA CCACGGCGCT
TCTGTGTCGT TCACACAAAG TCTGCGCCAC GGCTTGACGA TGCGTGCGAA TTACACCTGG
TCGCATAACA TCGACAACGC AACGAACGAA CTGAACTCGA GCTCGGTGAA TCCGCGCCGT
GCTGAAGATA GCTACGACCT CGATGCGGAA CGGGGAAATT CAGTGCTGGA TGTACGGCAT
AAGTTCGCCG TAGCCTGGAC CTATCACACC CCGAACCTGA CAAGCGGTAG CCGTTTGCTG
AGAGCGCTGG CGAACGGGTA CGAAATCAAC GGCGACTTCA TCGCCCAGAC CGGTCAGCCG
GTTACGATTC TGTCTCCTTA CGATGCGAAT GCAAACGGCG ACACCGCTGG CGAGCGCGCG
ATCTTCAATC CCCACGGTTC CCAGAACCTC TCTACCGATG TGAACTTCGT TTGTAACGAC
GGAAGCGGTG GCGCGACCAG AATCGTGAAT CCGCAAGATG CCGCGGCAAC GCCGTGTTCG
CCTTCGAACG TGGTGGGCTA CGTCAGCATG AATTCAGGCG CTGCTTTCGT TGCCAGTCAA
CTTGGTGCGC GTTCGAATCT CGGCCGTGAC ACGGTGTACT CGCCGGGATT CGGAGTATGG
AACGCGTCGC TCGGGAAGAG TTTCCGGGTG ACGGAAGGGA AGTCGTTCCT GGCGCGCGTA
GAAGTTTACG ACGTATTCAA TCATCGCAAC TTTACGGTGG CCGGTCCGGT TACGGTGTTC
GGAACCGCAA CTGGACTCCA AGCGTTCAAC CTTGGCTATG TGCAAACGGG GAGCAGCCAG
TTCCTGGATT CCAAGCAGTT TACCGGCGGC GCCCGTGCTT TGCAGTTGGT CTTCAAGTTC
ATCTTCTGA
 
Protein sequence
MKQMRQAIAR ARIGSPWKSV STILIMLCLT VGAMAQRITG TLRGQVTDSA GSVVVGAKVT 
AANQDSGVTE KTATNSAGTY IFPELLPGPY TVTVQSEGFA TSAVRDVRVA TNVVNDRNVS
LAVGGSTTTI DVNAAAETVD LSSSTVATTF DTRETLDIPS GSNSPLQLAL FSANTTAQQG
GVTGTGGSVS GTRPRSNSFN IDGVDDNNAG TSGQISNVIQ DAVAEFNLVT NPFSAEYGHA
GGGQFNIVTK TGTNSWHGSG EYYLQNRFLN ALDNLTKDAI AQGAIDHTPR LDVSRVGGTI
GGPIIKNRWF IFGAYEYFDQ RADSLGADIE TPTAAGISTL QSLAATPYIA NLIGTLPAAQ
TANSAPLLVN GVSIPTGLVP QVAPNPFKEH DFQINSDLKE GRHELSARFL FNKQDIITAG
AISTPEYNLP TTLTNYKAAL IDTWSISNTL VNDLRISYSH SLQSLAVPEP FSNNPLIFLA
DMNGITFGAN DPQRILQDVY QVIDTQTKIF GRHTLKYGGE YRHYIAPAFF LQRSNGNYFY
LSTQTFINDG VPEIQMLRGA GDPVFPETQS AVGAFLQDDF KVSNRLTLNL GLRYEFTNNP
SGAERQAKNA ISNVPGVIDF HAPNTAKLDF EPRIGFAWDP TGTGKTSVRG GIGLGYSPPV
NNFNQNAQPP QVQTVLNLGT ACFGGLTTAP AWCASGDHFF AQGALPSTYT LAPGPDIPRA
LTASIMPDTI DARIVNWSLG IQREVYAGGV LDVRYVGSRS FHLPTQIRLN SISAFDAGLT
ALPTYFSNSE VPSAVPNPAS TQADFKTFLA NQGFAPYSQY GFVNVLTEIG PFGASVYHGA
SVSFTQSLRH GLTMRANYTW SHNIDNATNE LNSSSVNPRR AEDSYDLDAE RGNSVLDVRH
KFAVAWTYHT PNLTSGSRLL RALANGYEIN GDFIAQTGQP VTILSPYDAN ANGDTAGERA
IFNPHGSQNL STDVNFVCND GSGGATRIVN PQDAAATPCS PSNVVGYVSM NSGAAFVASQ
LGARSNLGRD TVYSPGFGVW NASLGKSFRV TEGKSFLARV EVYDVFNHRN FTVAGPVTVF
GTATGLQAFN LGYVQTGSSQ FLDSKQFTGG ARALQLVFKF IF