Gene Plut_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_2035 
Symbol 
ID3746082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp2265697 
End bp2266887 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content57% 
IMG OID637770066 
Producthypothetical protein 
Protein accessionYP_375920 
Protein GI78187877 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000607972 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.528651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG GCGGCAAGGA GTATGTTTTT CACAGCGTGG AAACCTTTAG TACATTAAGA 
AAACAAGAGA CATTGATTTT TTTCAAGGCA CCCAACAATC ACGCACCCAT GACACGACCG
GGCCGACTGC TCATCCTCAC CGCAAGCATC TTGCTGCTTC AGGATGCCCC GCTTCCGGGC
CTGCACCATA ATACGGCTCA TGCGGAGAAA AAGAAAATCA TTCTCCGCCA CGCCGACACG
ATCGAAGGCG GCGAGGATGC CGGCGGCAGC TTCCGCGCGG CCATCGGCAA CGTGTTGTTC
GAGAAGGACA ACCTGACATT GAGTTGCGAC CGCACCACCG ACTATGAGGG ACAGGACCGC
ATTGTGATGA ACGGCCATGT CCGCATCTCC GATGGCGCCA AAGAGATCTT CGGCGACGAT
GGGGTCTTCC ACCCATCCAC CGACGTCTGC GAACTCCATG GCAACGTCCG CGGCCGGATG
CTGGACAACT CCATGATGGT CCGGTCCAGA AAAGCAGTCT TCAACAACCA GGAAAACCGC
CTCTGGTTCT ACGACAACGC CATCGGGTGG CGTTACAACG AACAGGTCAG CGGAGACATC
CTGCGGATCC AATTCAAGCC TGTGGATGAG AATGCACCGT CGAAAGATGG TGGCGGCAAA
CAGAAAATCG ACGAAATGCA GGTGCACGGC CACGCATTCT ACGCCACACG CGACACGCTG
ACGGCCGACC CGGAAGGCTA TGACCAGCTC AGCGGGCGCC ATATCGTGGT GCTCATCGGA
GATGACTCGA AAGTAAAAGG GGTCACGGTG ACCAAAGAAG CCGAAAGCCT AGTCCATATT
TATGACAACG ACGGCATGGA GAAGCCGGAC GGCAGCGGGA AAGGCAACAG CAGTGACAAG
GGCAAGGGCT CGGACGATGG CGCGAAAAAG CTGAGCGGCA TCAACTACTC GAGCGGCGAC
AGAATCAAGA TGATCTTCAA GGAAGGCACT CTGAAGAAAA TGAATGTCAC CGGAAATGCC
GAGGGAACTG AATACCCTCC CGAGCTGCGC GGATCGGTCA ACCTCCCGAA ATTCAAATGG
CGGGAAGACG AGCGACCATT CGGAAAACAG GCCGCCGGGG ATAGAACGGC CGCAGGCATC
AAGACCGTGG ACAAGGGTGC CGAGGCGCCT GAAAGAGAAA GCCTTCCGTA A
 
Protein sequence
MAAGGKEYVF HSVETFSTLR KQETLIFFKA PNNHAPMTRP GRLLILTASI LLLQDAPLPG 
LHHNTAHAEK KKIILRHADT IEGGEDAGGS FRAAIGNVLF EKDNLTLSCD RTTDYEGQDR
IVMNGHVRIS DGAKEIFGDD GVFHPSTDVC ELHGNVRGRM LDNSMMVRSR KAVFNNQENR
LWFYDNAIGW RYNEQVSGDI LRIQFKPVDE NAPSKDGGGK QKIDEMQVHG HAFYATRDTL
TADPEGYDQL SGRHIVVLIG DDSKVKGVTV TKEAESLVHI YDNDGMEKPD GSGKGNSSDK
GKGSDDGAKK LSGINYSSGD RIKMIFKEGT LKKMNVTGNA EGTEYPPELR GSVNLPKFKW
REDERPFGKQ AAGDRTAAGI KTVDKGAEAP ERESLP