Gene Lcho_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_4056 
Symbol 
ID6162889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4542212 
End bp4543594 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content77% 
IMG OID641666834 
Producthypothetical protein 
Protein accessionYP_001793073 
Protein GI171060724 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family
[TIGR03350] type VI secretion system OmpA/MotB family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000056515 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGCCC AGCCCCCCCG CCCCGGTGCC GGTCCGAACG ATCCGCACGA TCCGTTCGCG 
GCGCTCGACA GCGGCGCCAC CGTCATCCGC CCCAACCCCG GCGCCCGCGC CGCGCCGGCG
CGCATGCCGG CCGCCCAGGA GCCGCCGGCC GCCGACACGC CGCTGCCGGC GCAGGGCCTG
AATCCGCTGG TCAGCCTGGC CAACCGCCTG CTGCTGGCGG TGCCGCAGCT GCGCGCGACC
CGCCACGTCG CCGACCCCGC CGCGCTCAAG CACAGCCTGG CGCAGGCGGT GCGCGACTTC
AGCACCGCCG CCGCCGCCGC CGGCATCACG CCGCAGCAGG TGATGGCCGC GCGCTACGTG
CTGTGCACCG TGCTCGACGA GGCCGCCAGC GACACGCCCT GGGGCGGCGC CGGCGTGTGG
GCCCAGCACA GCCTGCTGGT GACCTTCCAC AACGAGGCCT GGGGCGGCGA GAAGGTCTTC
CAGCTGATGG CCCGGCTGGC CGGCCAGCCG GCCGAACACC GCGACCTGCT GGAGCTGATC
TACGCCGCGC TGGCGCTCGG TTTCGAGGGG CGTTTCCGCG CCATCGAGAA CGGCCGCGCC
CAGCTCGACG CGGTGCGCGA CAAGCTCGCG CGCATCGTGC TGCAGGCGCG TGGCGACCAC
GCCCCGGCGC TGGCGCAGCA CTGGCAGGTC GAGGCGGTGG CGCAGCGCGC GCTGCCGGGC
TGGCTGCCGC TGCTGGTGAC GGCGCTGGTG CTGGGCCTGC TGCTGGTGGC CGCCTACATC
GGCCTGAGCT TCTGGCTCGG CGCGCGCTCC GACCCGGTGT TCGGCCAGAT CCAGGGCCTG
CGCCTGAACC CGCCGGTGGC CGCGGTGGCG CAGCCGGCGC CGCAGCCGCG GCTGGCGACC
TTCCTGCGGC CCGAGATCGC CGAGGGCGCG GTGGTGGTGC GCGACGAGGT CGACCGCAGC
GTCGTCACGC TGCGCGGCGA CGGCCTGTTC GAGCCCGGCA GCGCCACGCT CGCCGCGCCC
AAGGAGGCGC TGCTGCGGCG CGTGGCCGAC GCGCTGGCGC AGTTCGGCGG CGCGGTGCTC
GTCACCGGCC ACACCGACAG CCAGCCGATC CGCTCGGCGC GTTTCCCGTC CAACTGGCAC
CTCTCGCAGG AGCGCGCCGG CGCGGTGCGC GAGCTGCTGG TGAGCCAGCA GGTGGCCACC
GAGCGGGTGC GCGCCGAGGG CCGCGCCGAC GGCGAGCCGG TGGTCGCCAA TGACAGCGCC
GGCAACCGGG CGCTGAACCG GCGCGTCGAG ATCACGCTGT TCGTCGCCGC GCCGCCGGGC
GTGGCCACGG CCGCCAGACC CGCAACCCCC ACGGCCACGC CCGCATCCGG AGCCCGGCCA
TGA
 
Protein sequence
MSAQPPRPGA GPNDPHDPFA ALDSGATVIR PNPGARAAPA RMPAAQEPPA ADTPLPAQGL 
NPLVSLANRL LLAVPQLRAT RHVADPAALK HSLAQAVRDF STAAAAAGIT PQQVMAARYV
LCTVLDEAAS DTPWGGAGVW AQHSLLVTFH NEAWGGEKVF QLMARLAGQP AEHRDLLELI
YAALALGFEG RFRAIENGRA QLDAVRDKLA RIVLQARGDH APALAQHWQV EAVAQRALPG
WLPLLVTALV LGLLLVAAYI GLSFWLGARS DPVFGQIQGL RLNPPVAAVA QPAPQPRLAT
FLRPEIAEGA VVVRDEVDRS VVTLRGDGLF EPGSATLAAP KEALLRRVAD ALAQFGGAVL
VTGHTDSQPI RSARFPSNWH LSQERAGAVR ELLVSQQVAT ERVRAEGRAD GEPVVANDSA
GNRALNRRVE ITLFVAAPPG VATAARPATP TATPASGARP