Gene Lcho_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3843 
Symbol 
ID6161958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4306699 
End bp4307952 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content65% 
IMG OID641666616 
ProductNO3-/NO2-ABC transporter 
Protein accessionYP_001792862 
Protein GI171060513 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGC CCCAAGACCC GGTGACGCCC GTGAACCTGC CGCGACGCGA TTTTCTTCAA 
CGTGCTGCCG CCGTGTCCGG CGCGCTGGCC GTGCCCGGCG GCGCGTGGGC GGCCGGCTCC
GATGCGCCCG AGAAGAAGGA GGTGCGCATC GGCTTCATCC CGCTGACCGA CTGCGCCTCG
GTGGTGATGG CCTCGGTGCT GAAGTTCGAC GAGAAGTACG GCATCAAGAT CATCGCGAGC
AAGGAAGCTT CCTGGGCCGC CGTGCGCGAC AAGCTGGTCA ACGGCGAGCT CGACGCCGCG
CACGTGCTCT ACGGCCTGGT CTACGGCGTG CATCTGGGCA TCAGCGGCCC CAAGAAGGAC
ATGGCCGTGC TGATGACGCT CAACAACAAC GGCCAGGCGA TCACGCTGTC GAAGAAGCTG
GCCGACAAGG GTGCGGTCGA CGGCGCCGGG CTGGCCAAGC TGATGAAGGC CGAGCCGCGC
GAATACACCT TCGCGCAGAC CTTCCCGACC GGCACCCACG CGATGTGGCT GTACTACTGG
ATGGCGGCCA ACGGCATCAA CCCGATGACC GACGCCAAGG TCATCGTGGT GCCGCCGCCG
CAGATGGTGG CCAACATGCG CGTGGGCAAC ATGGACGGCT TCTGCGTCGG CGAGCCCTGG
AACCACCGCG CCATCATGGA CGGCATCGGC GTGACCGCGG TCACCACGCA GGACATCTGG
CGCGACCACC CCGAAAAGGT GCTGGGCGCG ACCAACGACT TCGTCACCAA GAACCCGAAC
ACCGCCCGCG CGATGGTGAT GGCCATCCTC GAGGCCAGCC GCTGGATCGA CACCGGCCTG
CAGAACAAGA TGAAGATGGC CGAGACGGTG GCCGAGAAGT CGTACATCAA CACCTCGGTC
GACGCCATCA ACCAGCGCAT CCTGGGCCGC TACCAGAACG GCATGGGCAA GACCTGGGAC
GACCCGAACC ACATGAAGTT CTTCAACGAC GGCGCGGTCA ACTATCCGTA CGTGTCCGAC
GGCGCCTGGT TCCTGACCCA GCACAAGCGC TGGGGCCTGC TCAAGGCCGA CGTCGACTAC
CTCGGCGTGG CCCGCGCGAT CAACAAGACC GAGATCTACA AGCAGGCGGC TGCGCAGGTC
AAGGTCAACC TGCCCAAGAG CGACATGCGC AGCAGCAAGC TGATCGACGG CGTGGTCTGG
GACGGCAAGG ATCCGGCCAA GTACGCCGCG GGTTTCAAGA TCAAGGTGGC CTGA
 
Protein sequence
MTMPQDPVTP VNLPRRDFLQ RAAAVSGALA VPGGAWAAGS DAPEKKEVRI GFIPLTDCAS 
VVMASVLKFD EKYGIKIIAS KEASWAAVRD KLVNGELDAA HVLYGLVYGV HLGISGPKKD
MAVLMTLNNN GQAITLSKKL ADKGAVDGAG LAKLMKAEPR EYTFAQTFPT GTHAMWLYYW
MAANGINPMT DAKVIVVPPP QMVANMRVGN MDGFCVGEPW NHRAIMDGIG VTAVTTQDIW
RDHPEKVLGA TNDFVTKNPN TARAMVMAIL EASRWIDTGL QNKMKMAETV AEKSYINTSV
DAINQRILGR YQNGMGKTWD DPNHMKFFND GAVNYPYVSD GAWFLTQHKR WGLLKADVDY
LGVARAINKT EIYKQAAAQV KVNLPKSDMR SSKLIDGVVW DGKDPAKYAA GFKIKVA