Gene Acid345_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1722 
Symbol 
ID4072067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2089650 
End bp2090915 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content61% 
IMG OID637983730 
ProductABC efflux system, outer membrane lipoprotein, NodT 
Protein accessionYP_590797 
Protein GI94968749 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.248885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGGAT CTTTGAGATT TGGGATCGTC GCGGTAGTCG CTGGTGCTCT GACGGGATGC 
ACGGTTGGCC CGCGGTATCA CGCGCCGGCG CGTCCGTCGG TGGGAACCTA CACGCCGGAG
CCGCAACCGA CGAAGACGGT GCAATCTGCG GGAAAAGACG GTGCGTCGCA AACGATCAAA
CGCTCGAGCG ACATTCCGGC AGAGTGGTGG ACGGTATTTC AGTCGCCGCA ACTGGACCGT
ATGGTGCATG AAGCGCTCGC GAACAGCCCG ACGCTTGCGC AGGCGAATGC TCGTTTGAAG
CAGGCACAAG AAGAGTCGAA TGCGCAGACG GGGGCGGCGA AGTATCCGAC GGTAAGCGCG
AACGGAAGCG CGACGCGGGA GAAACTCAAT CTCGCGACTT TTGGCGTACC GTTTCCCAGC
CCAGGGCCGT TCACGCTTTT GAATGGTTCG GTGGTGGTGT CGTATGCGCT CGACATTTTT
GGCGCGAAGC GGCGATTGAT TGAAGGCCTG AATGCGCAGG TGGAATTCCA GGAGTGGCAG
TTGCAAGGCG CGCGATTGAT GCTCGCAGGA AATGTGGCAT CCGCGGCGAT TCGGCAGGCA
GAAGTGCGGG CGCAGATGGA TGCGACGCGG AGCTTGCTGG CATTGCAGGA GAAGAGCGTA
AGTATCGCCG AGAAGCGCTA TGCGGCGGGG GGATTTTCTG AGTATGACGT GCGCAGTCAG
CAAAGAACCC TGGCGGAGAT ACAGGCGAGT TTGCCGCCGC TGGAGCAACA GCTCGACAGC
TTGAACCACC AGCTTGCATT TCTGATGGGA AAGACACCGG CGGAGGCGCA GGTGCAAGCT
TTGAGCCTCA ATGGTCTGCA TCTTCCGCAG GAGTTGCCGG TGAGCGTGCC GTCGGAAATG
GTACGGCAGA GGCCGGATAT CCGGGCGGCA GAGGCGCTGC TGCACCAAGC GAGCGCAAAC
GTTGGCGTGG CGACGGCGAA TTTATATCCG CAAATTACGC TTTCAGGAAG CGCTGGCGGG
ATCGGGACGA GCTTTACTGA GGGCGGCGCG GTGTGGAATG TTGGCGCATC GTTGAGCCAG
CCGATTTTCA ACGGCGGACA GCTGCGAGCG GAGAAGCGGA AGGCGGTTGC TGCCTACGAT
GAAGCGGGCG CGACGTATCG GCAAACGGTT CTGGAGTCAT TCCGCGAGGT GGCGGATGTT
TTGCGGGCGA TTGAGCACGA TGCGCAGACC CTGAAGTCGC GGACGGAGGC AGCGTCCCCA
AGCTGA
 
Protein sequence
MVGSLRFGIV AVVAGALTGC TVGPRYHAPA RPSVGTYTPE PQPTKTVQSA GKDGASQTIK 
RSSDIPAEWW TVFQSPQLDR MVHEALANSP TLAQANARLK QAQEESNAQT GAAKYPTVSA
NGSATREKLN LATFGVPFPS PGPFTLLNGS VVVSYALDIF GAKRRLIEGL NAQVEFQEWQ
LQGARLMLAG NVASAAIRQA EVRAQMDATR SLLALQEKSV SIAEKRYAAG GFSEYDVRSQ
QRTLAEIQAS LPPLEQQLDS LNHQLAFLMG KTPAEAQVQA LSLNGLHLPQ ELPVSVPSEM
VRQRPDIRAA EALLHQASAN VGVATANLYP QITLSGSAGG IGTSFTEGGA VWNVGASLSQ
PIFNGGQLRA EKRKAVAAYD EAGATYRQTV LESFREVADV LRAIEHDAQT LKSRTEAASP
S