Gene Acid345_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0638 
Symbol 
ID4069576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp783336 
End bp786545 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table11 
GC content56% 
IMG OID637982644 
Producthypothetical protein 
Protein accessionYP_589717 
Protein GI94967669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value4.6909e-05 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0437117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAA ACGAGCGCGT GCAGCTATCA GAGCGCATCC CAGGCACCGT GCGTCAGGTC 
CTACTTTTGG TTCTTTTCTG TTTGTTAGCT GTGCCGTTAT TTGCGCAGTT TGATACCGGC
ACGATTACCG GAACCGTGAC TGATTCCTCC GGCGCCGCAA TCCCCAGCGT AAAAGTCACT
GTGACCAACA CAGGCACCAA CGTTCAGAAG ACGGTGACGA CGAACGCGAC CGGATATTAC
GTCGCGTCGG AATTACCGGT CGGAAATTAC GTAGTGAGTG CGAATTCGAC CGGCTTTGCC
GAGACAAAAA GCCAGAGTGT TGTGTTGAAC GTAGGCGCGG TCGTACACGC GAACCTGGCA
ATGGCAGTCG CTGGGAGCGA GCAAAAAGTT GAAGTGACTG GCACCACGAC GTCCGTTGAC
ACCGAGACGG CCCAAAGCGG CACCACGTTG AACGCGACGC AGGTGGCGAA CCTTCCGATC
AACGGGCGCG ACGTGAGCAA CTTCCTTGAG ATCGCTCCGG GGTCGGTGGC TTCAACCACA
TTCTTCCAAG GGAGCGTCAA CGGGCTTGAG AACATTTTCA CCGGTCTGAA CATTACGGTT
GACGGCCAGA ATGCCTCACG CGGAGACATT AACGGATTCC TCGATACGGA AGGCCAGGAA
CTGGCGCGGG TCACGCGCGC GAGCGTCGAC AGCATCCAGG AAATTGACTT TACTAACAGC
GGTTTCAGCG CTGAAGCGGG ACGGTCACTG GGCCCGCAGA TGAACATCAT CACCAAGTCG
GGCACGAACG ACTTCCACGG GACAGCGTTT GAGTTCCTGC GGAACGACGC ACTGGATGCC
AAAGACTACT TCAATAACGG CAAGGCTGCC CCACTGAGGA TGAACCAGTT CGGCGGCAAC
ATCGCCGGTC CAATCATCCG CAACAAGCTC TTCTTCTTCG CGAATTACGA GGGCGATCGC
ACGCACATTA CGAACTTCAA TGCGCTGTAC GAGACGCCAA GTGCGTATAT GCGTTCGAGG
CCACATGACC CTCGGATGGA TCCTGTCTTC GCGCAACTGG CGCCGGTTCC GGTGGGATGT
ACCAGTATTC CTGCGCCGGC TTCCTGCGCA GTGCCAGGAA CCACGGATAC CACCGACGCG
ACAAGTACCG CGGGCGGCGC GCAGCTCGTG TATGAAACCG CGTCGTTGCC CGATATTCTT
CGCGAAGACA CCGGGTCTGT GAAAGTTGAT TGGAACATTT CGGACAAAGA TCGGATTTTC
TTCCGCTACA ACATCAATGA TTCGTTAACG GATTACACGT ATGGATTGAA CCAAGGACAA
GTTTCGCCGC AATCCATGCG GACGCAGTTG TTGAAGGTGG ATGAGACGCA CACCTTCAGC
CCGACATTGC TGAACCAGCT GAGCTTGGCG TTGAACCGGT TCTACTCAAA CACGGACTCG
AATACGCCAG AACCATTTGT GGGGTTCAGC GGCTTCTTCA CTAACCTCGG CAATCTGCCG
GGGCCGAACA CGTTCAACCA GATCACGCCG TTCAATGTGT TTGAAGTGTT TGACAACGTC
ACGAAGACGT CGGGGCGTCA TACGCTGCAT TTTGGCGCAC AGATTCGCGC CAACCAGTTG
AACGAGTGGT TGCGTCCTCA GCAGGTGTAC CAATTCGGCG GAGCAAACAT CTTTCAGCCG
GATATGTTCC ACGATTTCCG GACCGATAAC CCGTTCGTTT TGCAAAAGAT CGGGTTCCCG
GGATTTGTGG GCGTGACCAA CTCAAATTGG GGTCTCTATC TCCAGGACGA TTGGAAGGTA
ACGCGCACCC TGACGTTGAA TATCGGCGTT CGTTACGAGT ACAACACCGC GTGGAGCGAG
CGTCACAACA TCGAGCAGAA TTTCGATTTC GCGACGCAGG CCTTCCTGCC GCAAAACCAG
GACATCTACA ACGCGCCGAA GGGTGATGTT GCGCCGCGCC TCGGCTTCTC GTGGGACCCG
TTCGGCACTG GAAAGACGGT GGTACATGGA TACTACGGTC TGTTCTATAT GCCGATGCAG
TACGGCTTCG GGATGATCGG GAACATTCCC GATTACCAGA GCTATAGCGT GAATGTGTTC
CAGGCGATTT TCGGAAATCC TCCGTTCTCG ATCGCCTATC CGTCACCCAA CCCGCCGTTG
CAGCCAGGAA CGCAAAACGT AAATATCTTC CCGAGTAATC CTCAGGATCC TTTCTCCGAG
AACTGGATGT TCGGGATCGA GCAGGAGATT GCCCCAAACA CGGTTCTGGC CTTGAACTAC
ATTGGCAACC ACGCTATGCA CATGCAGGCG GGCGTTTCGT TCGCGAATGT GAACCTGAAT
CCCGCTAACC CATTCACGCA GGCGCGTCCG CTATCGGGAT ACGCGAGCGA GAACTATCTG
TGCGATTGCC TGTTCTCGAA ATACAACTCA CTGCAAGCGC AAGTGCGGCA CAACATCGGG
AAGTTGAATT TCGAAGCGAA CTATGTCTGG TCGCACGAGA TTGACGACCA GATGAACTTC
CTGAGTCCCG GGTATACCAA CCCGGCCGAC CCGAAGGCTG ACATTGCCAG CGGCGATTGG
GACGTGCGTC AGAACCTGAC AGGCAGCGTG GTGTATGCGT TCGGAAACCT GAAGGGAGAA
GCGGCGTGGA AACGGGCCAT CCTGGGCGGA TGGCAGGCTT CAACCATTCT CCAGGCGCGG
TCTGGCTTGC CGGTGAACAT CACGCTGGTG AGCGGATTGT TTGGGAACCC GACGCGTCCG
AATCGCGTGG CGGGACAACA AGGCTACCTG TCGGACATAA ATTGGCCGAA CAGCAGCTTC
AACTCCGCCG CTTATGAGAT CAATCCTAAC TACACCGGCG ACTGGGGCCC AGTGTGGGGC
AACACGGGAC GGAACGACCT GCGTGGTCCT GCCTTCGTGC AGTGGGACAT GTCGGGGATG
AAGAACATTC CGATCACGGA AAGGGTAAAC CTGCAATTCC GTGCGGACAT CTTCAACATC
CTGAACCACC CGAACTTCGC AACGCCCGAT GGTGGAATCT GTAGCTCGAT CACTGGGGCG
TTCACGGATG CCTCTGGATT CCATCCGGCA ACCTGCGCGC CGAATCCGAA CTTCGGCCGC
ACCGGACAGA CAGTTGCTGA CTCGAACACA AGCCAGATCG GACCGGGAAC CAACCGGCAG
ATCCAGCTCT CGCTGAAGCT GGTTTTCTAG
 
Protein sequence
MTTNERVQLS ERIPGTVRQV LLLVLFCLLA VPLFAQFDTG TITGTVTDSS GAAIPSVKVT 
VTNTGTNVQK TVTTNATGYY VASELPVGNY VVSANSTGFA ETKSQSVVLN VGAVVHANLA
MAVAGSEQKV EVTGTTTSVD TETAQSGTTL NATQVANLPI NGRDVSNFLE IAPGSVASTT
FFQGSVNGLE NIFTGLNITV DGQNASRGDI NGFLDTEGQE LARVTRASVD SIQEIDFTNS
GFSAEAGRSL GPQMNIITKS GTNDFHGTAF EFLRNDALDA KDYFNNGKAA PLRMNQFGGN
IAGPIIRNKL FFFANYEGDR THITNFNALY ETPSAYMRSR PHDPRMDPVF AQLAPVPVGC
TSIPAPASCA VPGTTDTTDA TSTAGGAQLV YETASLPDIL REDTGSVKVD WNISDKDRIF
FRYNINDSLT DYTYGLNQGQ VSPQSMRTQL LKVDETHTFS PTLLNQLSLA LNRFYSNTDS
NTPEPFVGFS GFFTNLGNLP GPNTFNQITP FNVFEVFDNV TKTSGRHTLH FGAQIRANQL
NEWLRPQQVY QFGGANIFQP DMFHDFRTDN PFVLQKIGFP GFVGVTNSNW GLYLQDDWKV
TRTLTLNIGV RYEYNTAWSE RHNIEQNFDF ATQAFLPQNQ DIYNAPKGDV APRLGFSWDP
FGTGKTVVHG YYGLFYMPMQ YGFGMIGNIP DYQSYSVNVF QAIFGNPPFS IAYPSPNPPL
QPGTQNVNIF PSNPQDPFSE NWMFGIEQEI APNTVLALNY IGNHAMHMQA GVSFANVNLN
PANPFTQARP LSGYASENYL CDCLFSKYNS LQAQVRHNIG KLNFEANYVW SHEIDDQMNF
LSPGYTNPAD PKADIASGDW DVRQNLTGSV VYAFGNLKGE AAWKRAILGG WQASTILQAR
SGLPVNITLV SGLFGNPTRP NRVAGQQGYL SDINWPNSSF NSAAYEINPN YTGDWGPVWG
NTGRNDLRGP AFVQWDMSGM KNIPITERVN LQFRADIFNI LNHPNFATPD GGICSSITGA
FTDASGFHPA TCAPNPNFGR TGQTVADSNT SQIGPGTNRQ IQLSLKLVF