Gene Lcho_3807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3807 
Symbol 
ID6160486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4271798 
End bp4272865 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content69% 
IMG OID641666580 
Productsulfate ABC transporter, ATPase subunit 
Protein accessionYP_001792826 
Protein GI171060477 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.772256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCG AAGTCCGCAA CCTCAACAAG CGCTTCGGCA AGACCGTCGT CTGCGACAAC 
CTCAACCTCG ACATCCCGTC GGGCGAACTG GTCGCGCTGC TGGGCCCGTC GGGCTCGGGC
AAGACCAGCC TGCTGCGCAT CATCGCGGGG CTCGAAGTGC CCGATTCGGG CAGCGTGCTG
TTCCACGGCG AGGACGCCAC CCACACCGAC GTGCGCGAGC GCCAGGTCGG TTTCGTGTTC
CAGCATTACG CGCTGTTCGC GCACATGACG ATCTTCGAGA ACGTCGCTTT CGGCCTGCGC
GTGCGACCCA AGGTGACGCG GCCGAGCGAT GCCGAGATCC GCCGCAAGGT CACCGACCTG
CTGCAGCTGG TACAGCTCGA CTGGATCGCC GACCGCTACC CGCACCAGCT CTCCGGAGGC
CAGCGCCAGC GCATCGCCTT GGCGCGCGCG CTGGCGGTCG AGCCCAAGGT GCTGCTGCTC
GACGAGCCCT TCGGCGCGCT CGACGCCAAG GTGCGCAAGG AACTGCGGCG CTGGCTGCGC
CGGCTGCATG ACGAGGTGCA TGTCACCAGC GTCTTCGTCA CGCACGACCA GGAAGAGGCG
ATGGAAGTGG CCGACCGCAT CGTCGTGATG AACCAGGGCC GCATCGAGCA GGTCGGCAGC
CCGGACCAGG TCTACGACCA CCCGGCCACG CCCTTCGTGC TGCAGTTCCT GGGCGACGTC
AACCTGTTCC ACGGCCGGCT CGGCCATGCG CCGGGTGGCA GCACGTCGGC CGCCGAGGTC
AGCTACGTGC GCCCGCACGA ACTCGAAGTG ATCGGCGCCC CCGAGGCCGA CACGCTGGCG
GTGACGCTGA GCCAGGCACT CACCGTCGGC CCGAGCACGC GGCTGGAGTT CAAGCGCGAG
GACGGCAGTT ATGTGGACGT GGAACTGCCG CGCGCGCAGT GGCAGCTGCT GCGCGAACGC
CTGGGCCTGG CCAACGGCAG CCGCGCGTGG CTGAAGGCGC GGCGGGTGAC GCGCTTCGTG
GCCGGTGGCG AGCCGGCCAC GGCGGACGAT CCGGCCGCGA TGATCTGA
 
Protein sequence
MSIEVRNLNK RFGKTVVCDN LNLDIPSGEL VALLGPSGSG KTSLLRIIAG LEVPDSGSVL 
FHGEDATHTD VRERQVGFVF QHYALFAHMT IFENVAFGLR VRPKVTRPSD AEIRRKVTDL
LQLVQLDWIA DRYPHQLSGG QRQRIALARA LAVEPKVLLL DEPFGALDAK VRKELRRWLR
RLHDEVHVTS VFVTHDQEEA MEVADRIVVM NQGRIEQVGS PDQVYDHPAT PFVLQFLGDV
NLFHGRLGHA PGGSTSAAEV SYVRPHELEV IGAPEADTLA VTLSQALTVG PSTRLEFKRE
DGSYVDVELP RAQWQLLRER LGLANGSRAW LKARRVTRFV AGGEPATADD PAAMI