Gene Acid345_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3410 
Symbol 
ID4072746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4031695 
End bp4033500 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content62% 
IMG OID637985432 
Producthypothetical protein 
Protein accessionYP_592485 
Protein GI94970437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.581563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTG AACAGCTGGT TCAGGAACTG ATGGAAGCTT TGCTGAACGT CTTCGGGCGC 
GACGACAACG GCAACCGGCA GTACATGCTG GTGGAGACGT TCCCCGACTA CCTGATTGCG
CGCGGCCCGG ACGGCGACCT TTACCAGATC TCGTACACCG TGGGCGCCGA TGACGACGCG
ATCACCTTCA GCGATCCGCA GGAAGTGGAG ACGGCGTATG TGCCCGTTGC GGAAGCCGCG
ACCTTTCTCG CCGCGGTGGC AGGCGCGGAT CCGAACTCGA ATGTATTTCC GGTGGAAGTG
ATGCGCGCTG GCTTCGCCAA CGGAACCGTG CAATATGGCG GCAAGCCGTT GCGCCAGTTC
TTCCCGGAAG AAGTTGTCGC GCAGGTCGCA CAGGCGGTGA ATGGCGCGCG CTTCGGACGT
CGGCACCCAG TGGGCAATGA GAACGAAGGC AACCAGCCCG AGCTCGTCGC CGGCTGGCTT
GAGAACGGCC ACCTGGACGG CAACGGGAAG GCCGCACTTG CAGAAGTTCA CTTGCTCGAA
AGCGAGAAGG ACCTGCAGGC GAAGCTCGCG GCAGCACGAT CGGCGAACAA GCTCGATCTC
TTCGGCGTGA GCGTACTCGG CTACTTCGGA TTCCAGCCGA AGAAGATCGA CGGCGAAGAC
GTGCTCTACG CCACGCGCCT GGGCAAACTG GCGCGGGTTG ACCTGGTGGC TGAGCCTGCG
GCAGGCGGGC GATTTTTGAA TGAACTTCGC GTGGCGGCTT CGGCCGATGT CCGCGCGGAG
ATCTCGCGCA TGCAGCGCGA TGCAGTGAAG AGTGTACAAA ACGGCAGGGC GCGAGCCGCT
GGCCGACAAA AAGGAGAAGC GATGAAGGAT CGCATCTTGA AGTTACTGGC GGCGATGCGC
AAGCACAACG CGGCGAGCGC CGACCAGCTG ACGGTGGAGT TCAATGGCCT GTCGGAAGAC
AAGCACCAGG ACTTCCTGAT GAAAGTAACC GAAGCGGCTC TGACGATTGA GCCCGCTTCC
GCAACCGCTG CCAACAGCGA AGTGATCGAG CTGGCGAAGA CCGCACTGGC CGAAGCCAAG
AAGATCCAGT CGGGCAACCT GGTGGAGCGC AAGCTGAAGG ACGCGAAGCT GCCGAAGCCC
GCCGAAGACC TGGTGCGCAC ACACCTGGCC GAGCGCGAGC TGACCGAGGC ACAGGTGGAC
ACCGAGATCA CGGCGACGCG CGCAGCTTTC GCGGCGTACT CCACGATCGG CCGGCCCACC
GGCTCGGCGA TTGTGCTTGG TGGTGCAGAC AGCGTGGAGA AGGTGCAACT GGCGATGGAC
CGCCTGCTCC GCGTGAAGGA AGCGGAGAAG AGTGGTGTCG CTCCCTTCCG GGGCATCCGT
GACGCCTACG TGACGATCAC GGGCGACCGT GAGCTGAAGT TCGCCGACGG CGGCTTCTAT
CGCGCGAGCG AAGCGATCGC CACCGGCGAT TTCCCGAACA TCCTGCTGAA CTCGATGACG
AAGAAGTTGG AACAGGACTA TGCCGAAGTC GGCATGAGCG GCCTGGACCA GATCATCACC
AAGGCGAACA TCAACGACTT CAAATCGCAG GACCGCGTGC GGTTGGGCTA CCTCGGTGAT
TTGCCGACGG TCGCGGAAGC CGGACCTTAC ACGGAACTGA CCAAGCCCAC GGACGAAAAG
ATCAGCTACT CGGTGCTGAA GAAGGGTGGA ATCCTGACGA TCTCCGAAGA GACGATCCGC
AATGACGATC TCGGAAAGAT CGCAGCATTC CCCACGCGCC TGGCGCGCGC CGGACGCACA
CTTTGA
 
Protein sequence
MSFEQLVQEL MEALLNVFGR DDNGNRQYML VETFPDYLIA RGPDGDLYQI SYTVGADDDA 
ITFSDPQEVE TAYVPVAEAA TFLAAVAGAD PNSNVFPVEV MRAGFANGTV QYGGKPLRQF
FPEEVVAQVA QAVNGARFGR RHPVGNENEG NQPELVAGWL ENGHLDGNGK AALAEVHLLE
SEKDLQAKLA AARSANKLDL FGVSVLGYFG FQPKKIDGED VLYATRLGKL ARVDLVAEPA
AGGRFLNELR VAASADVRAE ISRMQRDAVK SVQNGRARAA GRQKGEAMKD RILKLLAAMR
KHNAASADQL TVEFNGLSED KHQDFLMKVT EAALTIEPAS ATAANSEVIE LAKTALAEAK
KIQSGNLVER KLKDAKLPKP AEDLVRTHLA ERELTEAQVD TEITATRAAF AAYSTIGRPT
GSAIVLGGAD SVEKVQLAMD RLLRVKEAEK SGVAPFRGIR DAYVTITGDR ELKFADGGFY
RASEAIATGD FPNILLNSMT KKLEQDYAEV GMSGLDQIIT KANINDFKSQ DRVRLGYLGD
LPTVAEAGPY TELTKPTDEK ISYSVLKKGG ILTISEETIR NDDLGKIAAF PTRLARAGRT
L