Gene Lcho_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_4051 
Symbol 
ID6163238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4537492 
End bp4538628 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content71% 
IMG OID641666829 
Productprotein of unknown function UPF0052 and CofD 
Protein accessionYP_001793068 
Protein GI171060719 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000160133 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGCA TCGTCATGTT CGGCGGCGGC AGCGGCAGCC GCGACATCAC CATGGCCCTG 
TGCCGACAGC GCTACGAGGT CACCCGCGTG GTGCCCGCCT GGGACAGCGG CGGCAGCTCC
AGGGCCTTGC GGGCCGCGCT CGGCATCCTG GCGATGGGCG ACATCCGCCA GGCCCTGATG
ACGATGGCCC ACGGCGAAGG CCGGGTCAGC AGCGTGGTGC GCTTCTTCAA CGCCCGCCTG
TCCGAGACCT CGAGCCAGCC CGATCTGCTG GCCGAATTCG ACTTCTACGT CAGCGGCGCC
CACCCCCTGC TGGCCACGAT GGAGCCCGGC ATCCGCGGCG CGATCCTCAA CTACCTGCGC
GTGTTCCAGT CGAACATCGC CGGCGACTTC GATTTCTGCC GCGGCAGCAT CGGCAACTTC
GTGCTGACGG GGGCCTACTT CGCCCACGGT CGCGACATCA ACACCGCGAT CTTCGTGTTC
CGCAAGCTGT GCGCCATCGA CGGCCACGTG TGGCCATCGA CCGCCGACGA CACGGTCGAG
CTGCGTGCCG TGCTGCGCGA CGGCCAGGTG GTGCGCGGCC AGGAGCGCAT CACCGACCTG
AACGCCGAAC AGGCGCAGGC GGGCATCGAG CGGGTCGAGC TGCTGCACGC GGGCGACGGC
AGGCCCGCGG CTTCGCGCCC GGCGGCCAAT CCGGCGGTCC TGGAGGCGAT CGGCACGGCC
GACCTGATGC TGTTCGGCCC GGGCAGCTTC TACACCAGCA CGCTGCCGCA CCTGTCGGTG
GCGGGCATCG CCGAGGCGAT CCGCGCCGCG CCGCCGCAGG TTCCCAAGGT CTTCGTCGGC
AACATCCTCG AATGCCCCGA GACGATCGGC GGCACTGTGG CCGAGCAGGT GCGCGCGCTG
CTGCAGGCCG GCGGACCCGG CTCGCTGACC CATGTCCTGC TCAACCGCGG CTGGGTGCCG
TTCGAGCGCG TGGCCAAGGG CTTTCGCTAC CTTCACGAGG GGGTGCTGCC CGAAGGCGGG
CCCGGGCTCC TTGCCGATGA TTTCGAGGAC CCGTGGCACC GCGGCCGGCA CGACGCACCC
AAGGTGGTCG AGCTGCTCGG CGAGCTCATC ACCCGGTCCG GTCCGGCGCC GAACTGA
 
Protein sequence
MKRIVMFGGG SGSRDITMAL CRQRYEVTRV VPAWDSGGSS RALRAALGIL AMGDIRQALM 
TMAHGEGRVS SVVRFFNARL SETSSQPDLL AEFDFYVSGA HPLLATMEPG IRGAILNYLR
VFQSNIAGDF DFCRGSIGNF VLTGAYFAHG RDINTAIFVF RKLCAIDGHV WPSTADDTVE
LRAVLRDGQV VRGQERITDL NAEQAQAGIE RVELLHAGDG RPAASRPAAN PAVLEAIGTA
DLMLFGPGSF YTSTLPHLSV AGIAEAIRAA PPQVPKVFVG NILECPETIG GTVAEQVRAL
LQAGGPGSLT HVLLNRGWVP FERVAKGFRY LHEGVLPEGG PGLLADDFED PWHRGRHDAP
KVVELLGELI TRSGPAPN