Gene Cag_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0203 
Symbol 
ID3746690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp231802 
End bp232890 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content45% 
IMG OID637772730 
Producthypothetical protein 
Protein accessionYP_378524 
Protein GI78188186 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACA ACCACAGCAC CAATCAGCGC ATTTTACTTT TGCTGCTGCT TGTCATCTCC 
GCCCTCTTTT TTACCATGAT CCGCTACTTT TTGTTAGTAG TGGTGCTTGC GGCAATTTTT
TCAGCCCTTG CCATGCCCGT CTATAACCGC TTTGAACGAG GGTTGCGTGG CAAACGTAGC
TTAAGTGCCA TTATGACCCT ACTCACCTTG CTGTTTATTG TTGTGCTGCC GCTTGCAATA
CTGCTTGGCT TAGTGGTAAA ACAAGCTATT CGGTTAAGCA ACGTTGCTGT ACCATTTGTG
CAAGAACAGC TTTTAACGCC CTCGCAATTC GATCACCATT TGCAATCGCT CTTCTTTTAT
CCCGAATTAG TGCTTTACCG TGAAGAGATT TTGCAAAAAG TAAGCGAGCT TGCCACCAAA
TTTGGAACAC TCCTCTTTAA TGCCATTTCA AGCTTTACCT ATTCAGCGGT TACCGAAATA
GTTCTGTTTT TTGTTTTTCT CTACACCATG TTTTTTTTCT TGCGAGACGG CAAGCAAATG
CTGCAATCAA TGTTAGCGCT GCTTCCCCTC TCACACACCG ACCAATATCG CCTCCTCGAT
AAATTTCTCT CAGTTACACG CGCTACCCTT AAAGGCAGCT TAGTAGTGGG AATGGTGCAA
GGTTCGCTTG CAGGTATGGC GCTTTACATG GCAGGTATAG AAAGTGCTCT TTTTTGGGGA
ACCGTTATGA GCTTTCTCTC TCTTATTCCC GTGCTTGGCT CAGCCCTTGT GTGGATACCA
GCCGTCATTT ACCTTGCAAC CATTGGTAGC TACCCACAAG CTCTTGGAGT TTTGCTCTTT
TGCATGATTG TGGTTGGGCA AATTGACAAC ATTATTCGCC CCATTCTTGT TGGGCGCGAT
ACCCAAATGC ACGAATTGCT CATTTTTTTT GGTACCCTTG GCGGCATAGG CATGTTTGGC
TTTTTTGGTG TTATTCTTGG ACCTATTGTG GCAGCCCTTT TTACCACCAT TTGGGAAATG
TATGCCGAAA GCTTTGGCGA CTACCTCTCC ACCATCCAAA AGAACCGCAC TTCTACACTC
AAAGACTAA
 
Protein sequence
MSNNHSTNQR ILLLLLLVIS ALFFTMIRYF LLVVVLAAIF SALAMPVYNR FERGLRGKRS 
LSAIMTLLTL LFIVVLPLAI LLGLVVKQAI RLSNVAVPFV QEQLLTPSQF DHHLQSLFFY
PELVLYREEI LQKVSELATK FGTLLFNAIS SFTYSAVTEI VLFFVFLYTM FFFLRDGKQM
LQSMLALLPL SHTDQYRLLD KFLSVTRATL KGSLVVGMVQ GSLAGMALYM AGIESALFWG
TVMSFLSLIP VLGSALVWIP AVIYLATIGS YPQALGVLLF CMIVVGQIDN IIRPILVGRD
TQMHELLIFF GTLGGIGMFG FFGVILGPIV AALFTTIWEM YAESFGDYLS TIQKNRTSTL
KD