Gene Acid345_2731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2731 
Symbol 
ID4069422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3228955 
End bp3230139 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content61% 
IMG OID637984748 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_591806 
Protein GI94969758 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.838486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.60857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCGA TTCTGGAGCG TTCGGTCGAG ATCGAGGGAA TTCGCATGGA GGAGCCGGTT 
GATGGGATGC GGCTCATGGT GCGACATCGC AAGCTGGTGG GCGTGGTTGC GGCGATTGCG
GCAGCGGTCG CGACAGTGGC TGCGATGTTG CTTCCGCCTA CATATACAGC GACGGCGCGT
CTACTCCCGC CACAGCAATC GCAGTCGCTC GCTGCGCTGT TCGTGGGGCA GACGGGGAAC
TCCCCGTTAG CGGCGATGGC GCAAAAAGAT CTTGGGTTGA AGAATCCTGC AGATCTTTAC
ATCGGCGTGT TGAATAGCCG CAGCGTCCAG GACGGATTGA CGAAGCAGTT TGAGTTGGCC
ACCGTTTATG GATTGAAGCG CCCTTCGGAT GTGCGCGCGC GGCTGGTCGA TCGAACCCGG
ATTCAGTTGA CGAAGGAAGG ACTGATCAGC GTCTCGGTTG AGGACCGTGA TGCGAACCGC
GCGGCGGGAC TCGCCAACGG TTACGCGGAA CAACTCCGGC TGGCGACGAA GCGACTGGCG
ATTTCGGAGG CAGCACAGCG GCGGAAGTTT TTTGATGAAC AAGTGCAGCA GACCCGTGAC
GAACTTGCGC GGGCAGAAAC GACATTTCGT GAAGTCCAGG AGAAGACCGG CATTCTGCAA
CTGGATGCGC AGGGGAAAGC GTTGATCGAA ACGGCGGCAA CACTGCGGGC GGAGATCGCG
GCCGGAGAAG TGCAATTGCG CGCGATGCGC AGCTTTGGAA CGGAGCAGAA TCCGGATGTG
CGTCAGCAGG AAGCGCAACT CGGTGGCTGG CGAGCGGAAC TGGCGCAACT GGAATCGCGA
CGGATGGGCG ACGGCTTCAG CAAAGGGCGC GCGCCTGCGG ATGCGCAGGA GTATGTGCAG
GCCATGCGCG AGGTCAGGTA TCGCGAGGCG ATGCTCGAGA TGCTGCTGCG GCAACTAGAG
GCAGCAAAGC TGGATGAGGC GAAGGAGGCG ACGATCGTCC AGGTAGTTGA TGTGGCGGTC
CCGCCAGATG TGCGGACATC GCCGAAACGG GCTGCCGTTG TGGTGTTCAG CACGCTTGGC
GCTGTGCTGG CGACAGCGGT ATGGCTGCAG CTGCGGCAGC GCTTTCTCAC GGATGTCGCG
TGGCAAGAAC GCTGGAGTGG GCTGCGCAAG GAGTGGGGAC CATGA
 
Protein sequence
MRAILERSVE IEGIRMEEPV DGMRLMVRHR KLVGVVAAIA AAVATVAAML LPPTYTATAR 
LLPPQQSQSL AALFVGQTGN SPLAAMAQKD LGLKNPADLY IGVLNSRSVQ DGLTKQFELA
TVYGLKRPSD VRARLVDRTR IQLTKEGLIS VSVEDRDANR AAGLANGYAE QLRLATKRLA
ISEAAQRRKF FDEQVQQTRD ELARAETTFR EVQEKTGILQ LDAQGKALIE TAATLRAEIA
AGEVQLRAMR SFGTEQNPDV RQQEAQLGGW RAELAQLESR RMGDGFSKGR APADAQEYVQ
AMREVRYREA MLEMLLRQLE AAKLDEAKEA TIVQVVDVAV PPDVRTSPKR AAVVVFSTLG
AVLATAVWLQ LRQRFLTDVA WQERWSGLRK EWGP