Gene Lcho_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3042 
Symbol 
ID6161569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3359423 
End bp3360508 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content70% 
IMG OID641665817 
Producthypothetical protein 
Protein accessionYP_001792067 
Protein GI171059718 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000069589 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGAGA CCACCCTCAG CCGCACGGCG ATCTTCAGCG CCCGCCCGAC GCTGCGCATC 
GCCGACCAGC CCGACGAGCG GCTGTCGACG CTGATGACCG CGCTGAAGAT GGACGAGTCC
GAAGGCGGCC TGAGCGCACT CGAACTGCAC CTGACCAACT GGGTGGCCAC CCCCGAGGGC
GGCGCCGAGC TGGCCTTCAA CGCCGACAGC AGCCTGCGCC TGGGCGCGGA CCTGGCGGTC
TATTGCGGTG ACGAGGCAAG CCCGCGCGAA CTCTTCAAAG GCAAGGTCAC GGCACTGGAG
ATGGTCTGCA ACTACGGCAC ACCGCCCGAA CTCGTGGTGC TGGCCGAGGA CGGCCTGAAC
GCCGCGCGAC GCAACCGCCG CAGCGAGGTC TACACCGACC AGAGCCCGGC CGACGTGGTG
CGCACGATCG GCGCTCGCAA CGGCCTCACG GTCAACGTCA ACGGCCTCGC CAGCCCGACC
GGCACCTGGG TGCAGCTCGA CGAAACCGAC CTCGGTTTTC TGCGCCGGCT GCTGGCGCGT
TTTGACGCCG ACCTGCAAGT GGTCGGCAGC GAACTGCAGG TGGCGGCCCG CCAGGATGCC
GCACGCGGCG AGATCGAGCT GACGCTCAAC AGCCAGCTGG CCCGCGTGCG CATCTGCGCC
GATCTGGCGC ACCAGGCCAG CGCCGTCAGC GTGGCCGGCT GGAATGCGGG CGACGGCAGC
GCCGTCAGCA GCGAGGCGAG CAGCCTGTCG AGCACCGGGC CGGGCTCGGG CCGCAGCGGC
ATCGACTGGG CGAAAGATGT CTTTGGCGAG CGCAGCGAGC ACCTCGCCAC ACCCGCGGTC
GGCAGCAACG ACGAGGCCCG CGCAGTCGCG CAAGCCGCGC TTGATCAGCG CTGCCGCCGC
TTCGTGCGTG CCGAGGGGCT GTCCGAAGGC AACGCGCAGC TGCGGGTGGG CAGCACCGTG
AAGCTGGTCG GCATCTCGGC GCAGTTCGAC AACCGCTACT ACGTGGTGCG CACCCGCCAC
CTGTTCGACA TGGAACAGGG CTACCGCACC GAATTCAGCG CCGAGTGCGC CTACCTCGGC
GGCTGA
 
Protein sequence
MSETTLSRTA IFSARPTLRI ADQPDERLST LMTALKMDES EGGLSALELH LTNWVATPEG 
GAELAFNADS SLRLGADLAV YCGDEASPRE LFKGKVTALE MVCNYGTPPE LVVLAEDGLN
AARRNRRSEV YTDQSPADVV RTIGARNGLT VNVNGLASPT GTWVQLDETD LGFLRRLLAR
FDADLQVVGS ELQVAARQDA ARGEIELTLN SQLARVRICA DLAHQASAVS VAGWNAGDGS
AVSSEASSLS STGPGSGRSG IDWAKDVFGE RSEHLATPAV GSNDEARAVA QAALDQRCRR
FVRAEGLSEG NAQLRVGSTV KLVGISAQFD NRYYVVRTRH LFDMEQGYRT EFSAECAYLG
G