Gene Cpha266_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1849 
Symbol 
ID4571191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2142687 
End bp2144033 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content48% 
IMG OID639766431 
ProductPUCC protein 
Protein accessionYP_912289 
Protein GI119357645 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAC TTAACCTGAT CCGCCTCTCC CTTTTCCAGA TGGGTTTTGG AATCATGCTC 
GGTTTTCTGC ATGATACCCT GAACCGGGTC ATGACTACGG ATCTTGGCAT CTCCTCAACC
ATTGTGTTTG GCCTCATCAG CCTGAAGGAG CTGCTTGCGA TATTCGGCGT CAAGGTCTGG
GCTGGCAACA TGTCCGATCG CGCGAATCTT TTCGGTCTGA AACGCACACC CTATATTCTG
CTTGGGCTTT TTTTCTGTGT TTTTTCCTTT ATGCTCTCTC CTGCGGCAGC CTATGAGGTA
ACTGTCGCCG GAAAAAGTTT TTCTGAACTT TTTCCGGCCA TATTTACCGA TATCGGTCTG
TTGAAGCTTG CGGTCATTTT TCTTCTGTTT GGTTTTGGAT TGCAGGTTGC CACAACAGCC
TACTATGCGC TTCTTGCCGA TACGGTTGGT GAAGAGAACA TTGGCAAGGT TACCGGTGCA
AGCTGGACTC TCATGGTTCT TACTACCATT ATTGCTACAA GGGTTGTCGG CTCGTTTCTC
GATGTCTATA CCCCCGAAAG GCTTATTACT GTTGCTGAAG TTGGTGGATC GATAGCGCTC
TGTATCGGGC TTTTTGCCGT ACTCGGTATT GAAAAGCGAA ATGTAGTTCC TTCAGAGGGC
AAGAGCAGGC ACTCCATCTC TTTTTCACAG TCACTGAAAC TGCTCTCTTC ATCACCGAAA
ACCCTGCTGT TTGCTTTTTA TATCTTTATC TCGATTTTTG CGCTCTTTGC CAATGAAATT
GTCATGGACC CTTTTGGAGG CGATGTATTC GGCATGCCGG TCGGTACAAC TACCAAGCTG
TTCCGGCCGA CAATGGGTGG TACGCAGTTG ATTTTCATGC TGATCGTGGG ATTTCTGCTC
AACAGGATCG GTCAGAAGCG AGGCGCGCAT ATCGGCAATT TTTTTGGTAT TATCGGCTTC
AGCATGCTGA TTGCCGCCGG CTTCATGCGC GATGAACAGT TCCTTCGCAT TGCGCTTGTC
GTAACCGGCA TAGGGCTTGG AGCGGCCAGC GTATCCAATA TCTCCATGAT GATGACCATG
ACGGCAGGTC GCAGCGGTAT CTATATAGGC CTCTGGGGTA CAGCGCAAAG CCTCGCTATT
TTTATCGGGC ATTTCGGAGC GGGTATTATT CGTGACGTGG TTTATCACCT TTCCGGAGCT
TATGTCTGGG CTTATGCCGC TATATTTTTA ATGGAAATTA TTGCCTTTAC GATATCGAGC
CTTGTTCTGC CCCATATTTC GAAAGAGGCG TTCGAAGCCG AAAGCAAAGC GAAAATCGCT
GAACTGCAAC CAGCAGAAGG GGGTTGA
 
Protein sequence
MKQLNLIRLS LFQMGFGIML GFLHDTLNRV MTTDLGISST IVFGLISLKE LLAIFGVKVW 
AGNMSDRANL FGLKRTPYIL LGLFFCVFSF MLSPAAAYEV TVAGKSFSEL FPAIFTDIGL
LKLAVIFLLF GFGLQVATTA YYALLADTVG EENIGKVTGA SWTLMVLTTI IATRVVGSFL
DVYTPERLIT VAEVGGSIAL CIGLFAVLGI EKRNVVPSEG KSRHSISFSQ SLKLLSSSPK
TLLFAFYIFI SIFALFANEI VMDPFGGDVF GMPVGTTTKL FRPTMGGTQL IFMLIVGFLL
NRIGQKRGAH IGNFFGIIGF SMLIAAGFMR DEQFLRIALV VTGIGLGAAS VSNISMMMTM
TAGRSGIYIG LWGTAQSLAI FIGHFGAGII RDVVYHLSGA YVWAYAAIFL MEIIAFTISS
LVLPHISKEA FEAESKAKIA ELQPAEGG