Gene Cag_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1604 
Symbol 
ID3746469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2094445 
End bp2095620 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content39% 
IMG OID637774144 
Producthypothetical protein 
Protein accessionYP_379902 
Protein GI78189564 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.562385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTTC GTAAAAACAT AGGCGAATCA TACACGCCAA CATATTTTCT TGCATCGCTC 
GGTAACGGCG GCTTAGCGGT AACTTTTTTC ATGTTTCTTA TGTTCATGAT TCCGCATAAA
GGTCGCCCTA TGCCCGTTTT TGAAGATATT GTTGCAGCAC TACAAAGCAC ACTCCCTATA
CAGTTTTTAA CTATTGTAAG TCTTGTTGGA ATTATATGGT TTTCCGCACA ACATTATCGT
ATGCTTATTT GGAACATTCG CCAATATCTT GCGTTTAAGC ATACCCCTGC ATTTAATCGC
TTTCAAACAA CGGATGCACA AGTACAATTA ATGGCTATAC CGTTAACCTA CGCTATGGCA
ATTAATGTCA TGTTTATTCT TGGTGCCGTG TTTGTTCCTC AACTTTGGAG TGTGGTAGAA
TACCTCTTTC CAATGGCAAT GGGAGCCTTT TTTATTGTTG GTATTTACTC TATTTCTATT
TTTTACACAT TCTTTTCACG AGTTATTGCG CACGGAGGCT TTGACTGCGA AAAAAACAAT
AGCTTAAGCC AAATGCTTTC CATCTTTACC TTTTCAATGG TTGCTGTCGG CTTTGCCGCA
CCAGGTGCTA TGAGCCACAA CGTGATTGTT TCAGGCGTAG GCATTATAAT GGCAACATTT
TTTCTTGCAC TTGTAACAAC GCTTGGTGTT ATTAAAATTG TGCTTGGCTT TCGCTCTATG
TTAGCTCACG GCATTAACTA TGAAGCTTCA GTTTCACTAT GGATTGTTAT TCCAATTCTT
ACTCTTGTTG GAATTACTAT ATATCGTATT GCTATGGGAT TAGTGCATAA CTTTGATGCC
GTTATCCATC CATGGGCGCA CGTTATTATG TTTACCGCCT TGTGTGGTAT TCAAATCTTT
TTTGGGTTGT TGGGGTATGG CGTTATGAAA GAGCTTGGCT ACTTTAACGA ATTCATTCAT
GGGGAAAGCA AAAGCGCTGT TTCATTTGCA GCAATTTGCC CGGGTGTAGC ATTTGTTGTG
CTTGGCAACT TTTTTATTAA CAGAGGTCTT GTAGCCGCTG GACTTATTGA AATGTTTTCA
GTTGCTTACT TTGTGCTCTA TATACCATTG CTTGCAATTC AAGCACAAAC CATTATTGTG
TTAATGCGTC TTACGCGTAA GCTTTTAAAA GCGTAA
 
Protein sequence
MALRKNIGES YTPTYFLASL GNGGLAVTFF MFLMFMIPHK GRPMPVFEDI VAALQSTLPI 
QFLTIVSLVG IIWFSAQHYR MLIWNIRQYL AFKHTPAFNR FQTTDAQVQL MAIPLTYAMA
INVMFILGAV FVPQLWSVVE YLFPMAMGAF FIVGIYSISI FYTFFSRVIA HGGFDCEKNN
SLSQMLSIFT FSMVAVGFAA PGAMSHNVIV SGVGIIMATF FLALVTTLGV IKIVLGFRSM
LAHGINYEAS VSLWIVIPIL TLVGITIYRI AMGLVHNFDA VIHPWAHVIM FTALCGIQIF
FGLLGYGVMK ELGYFNEFIH GESKSAVSFA AICPGVAFVV LGNFFINRGL VAAGLIEMFS
VAYFVLYIPL LAIQAQTIIV LMRLTRKLLK A