Gene Lcho_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3044 
Symbol 
ID6161571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3361205 
End bp3362872 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content70% 
IMG OID641665819 
Producthypothetical protein 
Protein accessionYP_001792069 
Protein GI171059720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000000101024 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCTGGTG AACTGCGCAG CGCCCACCAC GACGTCGTGA TCGGCGCGCC GCGCGACCTC 
GTCGCCACGC GCCGCCCGGC GCTGCTTCAG CGCAGTGACG ACGACTTCAT CGGCGCCACG
CTCGAAGCGC TGCGCGCACC GGCCGGCCGC GCGGCCTTGC GCGCCAAGCT CGCCCGTGCC
ACCGACGCAC AGGGCCGGCT CAAGCTGTTC CAGCCGATCC AGCGCCAGTT CCACGTCGCG
CTGATCGAGG CCTGGTGCGA CACCCCCGGC GCGCCGCGCA TCGACCCCAA GCGGGTCGAT
GCCGCCGGCC TGGTGGTGCG CCGACTCGGG CCTCAGGGCG CCGAAGGCTG GATGCACGCC
AACGGCCAGG TGCGCGGCTG GGTGCCGCTC GCACGCATCG GCGGCGAGAC GGCCGAACCG
CAGGCCGCCG CACGCTTGAC GGCGCGACTG ACCGGCGTGG CGCAGATCGA TCGGCAGCTC
AGCCTGCACG CCCGCGAGGA CGCTGACCAC CCGCTCGAAG AGCAGATCAG CGGCCTCTAT
CTGGCGCCGC CCGACGTCAA CGCCGACGCC GGCAAGACGC TGTTCTACGG GCTGGTGCCG
ACCACCAGCA GCGAGCTGAG CGACGTGCCG CCCGCACCCA CAGGCGACGA CGGTTTCGGC
GCCGGCTCGG ACGCCTTCCG CAACCACCTG GTCGAGGCGC TGCGCGGCGA CGCGATGGAC
TTGCCCTTCC CCGGCGATCT GCTCGTGCCC GGCTGGTTCG AGGCCAGCGA ATCACCCGGC
GTACGCCCAC CCGACGGCGT CACGCAAAAC CAGTTCAACC TGCTGCAACA GGACGTGCGC
ACCGCGGCCG GCGCGACCGC CTTGCGCATG CAGCGTTTCC TGCTGTTCCT GCGCCAGATC
GCGGGCGAGT TCAATGCCTT CGCGGGTGGC AACGAAGTCA AGACGCTGAA GACGATCCTT
GCCTCCATCC AGCTGCCACT GGTGCTGCGC CCGGATGAAA AGGTCGTGCG CTTCGTCCGG
GCCGATCAGT TCCTCGCCAA GGCCAGCGCG ATCCTGCTGG AGCAGACCGC CAGCGCTAGC
GTCGTCGAGA TGCCGGCTTA CTGGCCCGCG CTGGCCCCGG CCGACGCGAC CCGCCTGAAG
AACGCGCTGC ACCAGGCGCT GCAAGCCCGG CTGGCCGCGA TGCAGACCCA GGCCGGCCGC
TACGACGAGC CGACCGCGCG CTACAGCGTG CGCGCCTTCG TGCGCCTCAA GCCCGAAGGC
CACTGCCCCG CGCGCACCGT CTGGAGCGAG CCGAGCGAGC CCTTCGTGAT CGCGCCCTGG
TACGAAGGCG GCGGCGCGCC GCCGGTGCAG ATCCGCCTGC CCGATGCGTC CGACCGCACG
CTGCTCAAGG CGCTCAAACC CAACGTCGCC TTCATCGTGC CGCCGTCGAT GCAGAACCTG
CTGTCGGGCA AGGCGAAGGA TCTGCTCGAA GGCAAGGGCA GCGTCGGCAC GGCGGGCTTG
AGCTGGATCT GCAGCTTCAA CATACCGGTG ATCACGATCT GCGCCTTCAT CGTGCTGAAC
ATCTTCCTGA CGCTGTTCAA CTTGGTGTTC GGCTGGCTGT TCTTCATCAA GATCTGCATC
CCGTTCCCGA AGCTGGGCAA CAAGCCACCC GGAGGGTCTT CACCATGA
 
Protein sequence
MSGELRSAHH DVVIGAPRDL VATRRPALLQ RSDDDFIGAT LEALRAPAGR AALRAKLARA 
TDAQGRLKLF QPIQRQFHVA LIEAWCDTPG APRIDPKRVD AAGLVVRRLG PQGAEGWMHA
NGQVRGWVPL ARIGGETAEP QAAARLTARL TGVAQIDRQL SLHAREDADH PLEEQISGLY
LAPPDVNADA GKTLFYGLVP TTSSELSDVP PAPTGDDGFG AGSDAFRNHL VEALRGDAMD
LPFPGDLLVP GWFEASESPG VRPPDGVTQN QFNLLQQDVR TAAGATALRM QRFLLFLRQI
AGEFNAFAGG NEVKTLKTIL ASIQLPLVLR PDEKVVRFVR ADQFLAKASA ILLEQTASAS
VVEMPAYWPA LAPADATRLK NALHQALQAR LAAMQTQAGR YDEPTARYSV RAFVRLKPEG
HCPARTVWSE PSEPFVIAPW YEGGGAPPVQ IRLPDASDRT LLKALKPNVA FIVPPSMQNL
LSGKAKDLLE GKGSVGTAGL SWICSFNIPV ITICAFIVLN IFLTLFNLVF GWLFFIKICI
PFPKLGNKPP GGSSP