Gene Clim_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1522 
Symbol 
ID6355779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1643794 
End bp1645074 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content49% 
IMG OID642669128 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001943551 
Protein GI189347022 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.101861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGCAGA AGCTCAAAGT GTTTTCATGG CTGCTGTTTG ATTTTGCCAA TACGGCATTC 
AGTGTCATGA TGGTGACCTT TGCTTTTCCT CTTTATTTCA AGAATGTGAT CTGTGGCGGG
GCTCCCTCAG GCGATGCAAT GTGGGGGATA AGTGTCAGCG TCTCGATGTT GTTTGTCGCG
GTGATCTCTC CGGTACTCGG TGCCGCTTCG GATTATTCCG GCAGGCGCAA ACGGTACCTT
TTTTTCTTTA CCCTGCTTTC TGTAGTTGCA ACGGCGTTGC TCGGCTTTTC GGCACCGGGT
ATGGCTATTG CCGCGGCTCT GCTTTTTATA CTCGCAAACA TGGGATTTGA GGGAGGGCTG
GTTTTTTATG ATGCATATCT CAAGGAGATA GCTTCGGATA AAAGTATCGG CAGGGTATCC
GGTTACGGTT TTGCGATGGG ATATCTTGGC TCGCTCACCA TTCTGCTGCT TATGATGCCC
CTGCTCAGTG GCGGTATTGT GCCGCAAAAC GCGTCCAGTA TCCGTACTGC TTTTATGGTG
ACAGCGTTAT TTTTTGCGAT ATTTTCACTT CCCCTTTTTG TTGTGCTTCG TGATGAAAAG
AAGCGCGATG TCCGCGCTCT TTCCATGGGA TTGATCGTAC GTTCCATAAA AGAGGTGAAG
CATACGGTTG GCCACATCAT GCATTATCCT GATCTTGCCC GCTTTCTTCT CGCCTATTTT
TTCTATAACG ACGCCATTCT CACCATTATC GCGTTTTCAT CGATTTATGC CCAGAATACG
CTTGCATTCA CAACCAGGGA ACTGATAATC TTTTTTATGC TGGTGCAGAC TACAGCTATT
GTCGGGTCGG TTGTATTCGG GTTTATTACC GATTGGATAG GTCCGAAAAG AACCATTGTC
TTTACCCTCA TGATCTGGTT TGGCGTGGTT CTCGCTGCGG TATTTGCCGA CAGCAAAGTG
CTGTTTTTCG CAACCGGCAT GCTGGCCGGT ATGGCTATGG GGTCTTCGCA GGCAGCTTCC
CGATCAATGA TGGCAAAACT GACTCCCCGT GAACATGTTG CCGAGTTTTT CGGTTTTTAT
GACGGGACCT TCGGAAAGGC TTCGGCGATA GTCGGCCCTC TTGTATTCGG TATGGTTTCG
GCGCAGGCAG ACAGTCAGAA AGCAGCGCTC TCTTCACTGC TTGTTTTCTT TGTGATCGGT
CTTGTTCTGA TGCTGCGGGT CAGGTCGCAG GGTATGACGG TGAGTGACCA GCACTCCATA
ACGGGGGCAA CGCGTTTATA G
 
Protein sequence
MTQKLKVFSW LLFDFANTAF SVMMVTFAFP LYFKNVICGG APSGDAMWGI SVSVSMLFVA 
VISPVLGAAS DYSGRRKRYL FFFTLLSVVA TALLGFSAPG MAIAAALLFI LANMGFEGGL
VFYDAYLKEI ASDKSIGRVS GYGFAMGYLG SLTILLLMMP LLSGGIVPQN ASSIRTAFMV
TALFFAIFSL PLFVVLRDEK KRDVRALSMG LIVRSIKEVK HTVGHIMHYP DLARFLLAYF
FYNDAILTII AFSSIYAQNT LAFTTRELII FFMLVQTTAI VGSVVFGFIT DWIGPKRTIV
FTLMIWFGVV LAAVFADSKV LFFATGMLAG MAMGSSQAAS RSMMAKLTPR EHVAEFFGFY
DGTFGKASAI VGPLVFGMVS AQADSQKAAL SSLLVFFVIG LVLMLRVRSQ GMTVSDQHSI
TGATRL