Gene Acid345_3824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3824 
Symbol 
ID4071108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4520419 
End bp4521639 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content57% 
IMG OID637985847 
Producthypothetical protein 
Protein accessionYP_592898 
Protein GI94970850 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.372888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.874565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGATCG GGGAACTCAT CCGGTACGAG CCGTTTGGAG AGCGCTTCGA GGAGACGACT 
GCGAGGTTTT TGAAGGCACG TTTTGGTGGT GATTGGAAAG TGCGCTGGAG TCCCGGGCGC
GTGGGAACGG TGCCAGGCGC GCAGCAATGG CTGGTAAATT ACGAGATCAA CTCGGTGTTC
CATCCCACTG CGAGAGCGAA TGTGTTTGAT GTAGTACGCC GCGAGTTCTC GTCTAGCCCG
GTACGGTGGA AGCGGCCGTT GCAGCGGATG TATTTTGGGG CATCGGTTTC GAAAGTGTTC
GCGCCGGGGA TGGCGCACGC GCGAGTTGAT ATTTCGCCGG CGGTCCCCGA TCCGCAGAAA
TGGCTGATTG TGCCGGGGAC GCACAAGGTG AGGTACATCG ACACCGAAGA ACGGCGTGTG
TACTGCCATT TGAAGCACGG CTCTCGGATG GATCGTTTTG CGAAGGAAAT CGAGGCACGA
AGGTCGGCCT CGGGCGCTGG AGTGGCGGTG CCGGGGATCG TTGGCGAGCT TGGTGAAGAG
TGCGTGATCG AGGAGATGGT CGTCGGGACG CCACTGAATC GGCTGTCCGA CGCTAAACTG
CAGCAAGATT GTGTACTGCA GGCGAAGAGT TCTATGCAGC CTCTGTACGA CGCGACGGTA
TGCCAGGAAC AGCAATCGGA ATACGCGAAA CGACTCTCGG GGGAGATCGC CGCAGCGGTT
GCCGGTACGA GGATTGTAGC ATCGCTTCGC GATACAATCC TGAGTGCGGT TGAAAACATT
CAGGATTGCC TGCAAGACCC CTCGGTCGTA CAAACAGTCC AGAGTCACGG CGATTTCCAG
CCAGCGAACA TTCTCTGGGA CGGCCAGCGG GTTTGGATCA TTGACTGGGA ATACTCGGGG
CGACGTCAGC GTGACTACGA TGCGCTGGTT TACGCGTTGC AGTCGCGTTT TGCACGGGGA
ATCGCTGCCA GGACGCGAGT GTACTTAAAG GGAATTGCTA CACGGGAGCG GGCAGAGGCT
GTCGCTCGAA TTCGTTGTTT CTTGCTCGAA GAGTTTGCGT TTCGCTGTGA AGAACTCACC
GCCGCGACGC ATGAGGTAAT TGCGCCGACA TTTCTCGAAC TGCTCGAAGA AGCAGAGGAG
ATGCTCGTCG TTTTAAGCGA AGAGGAAAAT GCGGGGAAAT CCAAAGTATC TGTTTTTCAA
ACAAACGGCG CCGTTTCGTA G
 
Protein sequence
MKIGELIRYE PFGERFEETT ARFLKARFGG DWKVRWSPGR VGTVPGAQQW LVNYEINSVF 
HPTARANVFD VVRREFSSSP VRWKRPLQRM YFGASVSKVF APGMAHARVD ISPAVPDPQK
WLIVPGTHKV RYIDTEERRV YCHLKHGSRM DRFAKEIEAR RSASGAGVAV PGIVGELGEE
CVIEEMVVGT PLNRLSDAKL QQDCVLQAKS SMQPLYDATV CQEQQSEYAK RLSGEIAAAV
AGTRIVASLR DTILSAVENI QDCLQDPSVV QTVQSHGDFQ PANILWDGQR VWIIDWEYSG
RRQRDYDALV YALQSRFARG IAARTRVYLK GIATRERAEA VARIRCFLLE EFAFRCEELT
AATHEVIAPT FLELLEEAEE MLVVLSEEEN AGKSKVSVFQ TNGAVS