Gene Acid345_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1678 
Symbol 
ID4069346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2029530 
End bp2032490 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content58% 
IMG OID637983686 
Producthypothetical protein 
Protein accessionYP_590753 
Protein GI94968705 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.675955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGAAAA TTCAGGTGCT GGTTTTCGCG CTGCTATGCG CGCTTGCGTC GTTTGGTCAA 
GAATTGGGCA CTGCTACCCT GACGGGGACG GTTGTCGATC CGTCGGGAGC AGTGATTGGG
GGCGCAAAGG TCAGCGCGCA GAACAAGGCC ACTGGGATGC AACGAGAGGC TCTGAGCACG
AAGGCAGGGA TTTTCGTGTT TAACGATCTT GCTCCGGGTG AGTATGACCT GACGATAAAG
GCGGATGGAT TTGCCGTGAC CACCGCGACA GTGCGCATAA CTGTAGGTCA ACAGGCGAAC
CTGCGCGCGG AAGTGCGGGT GGCGCAGTCG GAACAACATA TTGACGTTTC GGGCGCGATT
CCGCTGGTGG AGACGAGCTC GTCCGTGGTG GACGGCGTGG TGGATTCGAA GCAGATTGAT
GCCCTCCCCC TGAATGGCCG TAACTTCCTG GAACTCGCAT TGCTGATGCC GGGGAATGCG
CCGGCGCCGC TGTATGACCC GACAAAGTCA GACACCGTTT TGGTTTCTTC GACGGGACAG
CTTGGACGCG GTCAGAACAT CACGATTGAT GCGTCAGACA ACAACGATGA TGTGGTGGGC
GGAATGCTGG TGAACATTCC GCAGGATGCA GTGCAGGAAT TTCAGATTGC GACGAACCGG
TTCTCGGCGG AATTGGGGCG ATCGACTTCG TCGGTGGTGA ACATCCTTAC GAAATCGGGC
ACCAACGATC TGCACGGGAC CGCAGCGATC TTTGCACGCG ATGCGGCACT GCAGGCAAAA
CCCAATTCGT TTGGACAGAG TGTTGGAGAT ACGCCGCCAT TCAGGCGCGA TCAGTACACC
GGCTCCATCG GCGGACCTGT GGTGAAGGAC AAGGCGTGGT GGTTTACGTC TTTCGAATAT
CGCGACCAGA TGGGCGGCAT TTTGGTGGGC GAGCGCGATC TGGCAACGCA GTCTATTCAC
ACGACCTTTG CTGCTGCGCC GCTGACGGAT GCGTTGGGGA CAGCGAAGTT TGACTGGGCG
ATTTCGTCGA AGGACACGCT CTCGACGCGC TACTCCACCG AGAGCTTCGA AGGGACCTCC
AATAGCGCGA CCGACCGCGC GCTCGGCACG GCGTCGCAGA CGCAGGCTTC GACGAACCGC
TTCCACGACA TCAACACGAA CTGGACGCGC GTGATTTCGG CTGCATTGGT GAATCGCGCG
CAATTCGCCG TGAACCTCTT CCAGAACGAC ACGGTGGCGA ACGGTACCGG GCCGCAGATT
AATTTCCCGA GTATCGAGGC GGGTTCGTCG TATCGCGTGC CGCAGGCGAC GCACCAGCAA
CGCTTGTTGT GGGGGGACAC GCTGGATTGG ACCCACGGCA AGCACAATCT TAAGTTTGGT
GCGCAGATGC AGCGCGTGGG ATCCGACTTT AACCTTGGCG TCTTCCAGCA GGGAATTGTG
AACGCCGTGG AAGATTTTCC AGACTTCGAT CGCAATGGCG ATGGAGTGGT GAACGACAAC
GATCTGCTGT TTGCAGTAGG GCTGGTCAGC CATACGCCGA CGCGCCCGCT GATTATTCCT
GATGCCGACA ACAACTATGT TGCGTTGTTC GCGCAGGACG ACTGGCGGGT GCATCCGCAG
TTGACGTTGA ACGTCGGTCT GCGATGGGAG CTCGATACCG ACGTGAAGAA CGTTGGACAT
TATGACGAGA TCAATCCGCT GGCAAAGCCG TTCCTGCACG GCGATCGTTC GGCGGACTAT
ACGAACTTCG GGCCACGCAT TGGTTTCAAC TGGGCCAACA AGCCGGGGAC GTTGAGCGTG
CATGGCGGAT ACGGGATGTA CTACGACCGC ATCGTGCTGG AAATTGTGTC GTTGGAGCGC
GGCGAAGATG GTCGCGCGCT GGCGATTGAT GTGCATGCAG GAAATGTCTT TCCGGGATAT
ATGAATCCGG ATGGGACCTT CATCCCGGGT GTGACGCCGA CACTCGCGGA TCCGTTTACG
GGATTCATTC TTCCCGGCGC GGGCGCGGGT GGAATTTACG CCATCAGCAA CCAGATGCAG
AACCCGATGG TGCAGCAATT CAATCTCGGA GTGCAGTGGG AGTTCCTGAA GAACTGGGTT
GTACGTGCCG ATGGAATGCA CGACTTTGGA CAACACTTCA TCATCGGCGT GCCAGTGGGC
ACGGTTTACA ACCCTGTGGT TGGTGGACCC GACACGGTGA AGATTCTTGA GTCGGCCGTG
AACACGCATT ACGACGCGCT GTTCCTAACC GTGGACCATA AGTTCTCGAA CCACTTCAAC
CTGCATTCGG CTTATACGCT TTCGAAGTCG TTGAACTATG CCAACGACGA CCAGATTCCG
TTTGCGAATG GGCCGATTGA TCCGACAGAT CTGCATCGCG AATACGGACC GACGCCGAAT
GACCAGCGGC ATCGCTGGGT AACGGCCGCG ACGGTTTCGT TGCCGTATGG AATCCAGTTC
TCGCCGTTGT GGACGCTGGC TTCGGGTGTG CCGATGGATA TCCAGCTTCC CGATGGAAGT
TCACGTGTGC CGGAGATGCA ACGCAACGCG GGTGGGCGCG AGTTCCACAA TGCAGCGGAG
CTGAATGCGT TCATCACGCA GTTGAATGCG GCGGGCGGAT CGAACGGAAC GTTGCTTCCG
CTGGTGAGTC CGAATGCCAA GTTTGGCGAT TCGTTCAACT CGTTCGATAT GCGGTTGTCG
AAGACGTTCC GGTTGGGAGA CAGGATGTCG CTCGAAGTGC TGGGCGAATG CTTTAACGTC
TTCAACACGA CGAACGTGCT GGGCGTATCG AATACGAATT ACTCGGGATA CAACAACGTG
TTGGTGCGCG ATAGTAACGA TCCGACCAGC GCTGGATACC TCACGTCGTC GACGTTCGGC
ATGCCGAAGA CAACGGCGGG TGGGGTGTTT GGCTCGGGTG GCGCACGGGC CTTCCAGTTG
GCAGCGCGGT TTAACTTCTA G
 
Protein sequence
MKKIQVLVFA LLCALASFGQ ELGTATLTGT VVDPSGAVIG GAKVSAQNKA TGMQREALST 
KAGIFVFNDL APGEYDLTIK ADGFAVTTAT VRITVGQQAN LRAEVRVAQS EQHIDVSGAI
PLVETSSSVV DGVVDSKQID ALPLNGRNFL ELALLMPGNA PAPLYDPTKS DTVLVSSTGQ
LGRGQNITID ASDNNDDVVG GMLVNIPQDA VQEFQIATNR FSAELGRSTS SVVNILTKSG
TNDLHGTAAI FARDAALQAK PNSFGQSVGD TPPFRRDQYT GSIGGPVVKD KAWWFTSFEY
RDQMGGILVG ERDLATQSIH TTFAAAPLTD ALGTAKFDWA ISSKDTLSTR YSTESFEGTS
NSATDRALGT ASQTQASTNR FHDINTNWTR VISAALVNRA QFAVNLFQND TVANGTGPQI
NFPSIEAGSS YRVPQATHQQ RLLWGDTLDW THGKHNLKFG AQMQRVGSDF NLGVFQQGIV
NAVEDFPDFD RNGDGVVNDN DLLFAVGLVS HTPTRPLIIP DADNNYVALF AQDDWRVHPQ
LTLNVGLRWE LDTDVKNVGH YDEINPLAKP FLHGDRSADY TNFGPRIGFN WANKPGTLSV
HGGYGMYYDR IVLEIVSLER GEDGRALAID VHAGNVFPGY MNPDGTFIPG VTPTLADPFT
GFILPGAGAG GIYAISNQMQ NPMVQQFNLG VQWEFLKNWV VRADGMHDFG QHFIIGVPVG
TVYNPVVGGP DTVKILESAV NTHYDALFLT VDHKFSNHFN LHSAYTLSKS LNYANDDQIP
FANGPIDPTD LHREYGPTPN DQRHRWVTAA TVSLPYGIQF SPLWTLASGV PMDIQLPDGS
SRVPEMQRNA GGREFHNAAE LNAFITQLNA AGGSNGTLLP LVSPNAKFGD SFNSFDMRLS
KTFRLGDRMS LEVLGECFNV FNTTNVLGVS NTNYSGYNNV LVRDSNDPTS AGYLTSSTFG
MPKTTAGGVF GSGGARAFQL AARFNF