Gene Cag_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0437 
Symbol 
ID3748143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp512251 
End bp513582 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content49% 
IMG OID637772970 
Producthypothetical protein 
Protein accessionYP_378753 
Protein GI78188415 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTC TCGACCGATA TATTCTTAAA AAACAGATAG CGCCATTTTT CTTTGCCTTT 
ATAACCATTG TTGCACTATT GCAGCTCCAA TTTTTTTCAA CGTTTGCGGA GCGTTTTATT
GGTAAGGGCA TTACGTTTGT GGCAATTGTG GAATTGCTGG CGCTGCAATC GGCATGGATG
GTGAGCTTTG CCTTGCCCAT GGCGGTGCTT GTGGCTGTGG TAATGTCATT TGGAACGCTT
ACAACCACCT CTGAAATGAC GGTGTGCCGT GCTTCGGGTA TTTCGCTCTA TCGGGTAATG
GTGCCCGTTA TTGTGGTAAG CTTATTGCTT TCGTTCACGG TGGAGCGTTT TAATAATGTG
CTCTTACCGC AAGCAAATTA TCAAGCAAAG TCGCTCATGG CGGAAATTGC ACGCTCCAAA
CCTGCCTTTG GATTAACGGA GCAGGCATTT TCAACCTTGG TTGATGGCTA TTCAATGTAT
GTGCGTTCAA GCGATGAGCG GCATGGTGAG TTGCGTGGCG TGGTAATTCA CGATATGACC
AGACCTGAAT ATCGTACCAC CATTACCGCT ACGCGTGGAC GCGTGGAGTT TACCCCCGAT
TACCAATACC TTGTAATGAC GTTGCGTAAC GGTGCCATTC ATCAGTTGCA GCAGCCTGAG
AAAAGCGGCT ATCGTAGCAT GAATTTTGAA CGCTATCGCT TTGTATTTGA ATCGTCGCTT
TCGGGCTTTA CCCCCTCATC GGGTAACCGT ATGCGCGCTG ATGCCAATGA GTTATCGGCT
GGGGAGTTGC ACGCTATTGG ACTGGAATTT CGCCGCCGCG AAGCGGTTGC TTTGCTGCAT
GTGCAAGCGC CTCTTGTGGC GTTAGAAAGG CTTGCTGCCA ACACAGACAA CTCCAAAATG
GCAGCCTCGC CACCAACTCT TCGGCAGGAG ACCTCCGCAA TTGCAGCAAC AAAAGCCTTA
GCGGTTATTG AAGGCGAAAT AGCACGAGTT GCAAGCGAGT TGGAGGTTGC TTCCACCAAT
CGAACGCTCT ATAACCGTTA TATGGCGGCG TACCATAAAA AATATTCGCT GTCGTTAGCG
TGCGTGGTTT TTGTGCTGGT TGGGGCACCA CTTGGGGTGT TGGCTCGGCG TGGTGGCTTT
GGTGTTGGTG CTGCCATATC GCTCCTCTTT TTTGTGCTTT ACTGGATGTT GATGATTAGC
GGAGAGAAAA TGGCGGAACG AGGTGTGCTT GATCCCATGA TAGCCATGTG GATGGCTGAT
GGAGTAATGG CGCTCATTGG TGTAGGATTA GTAACAAAAT TAACGCAAGC CCTCTTTTCT
ACCTCACGGT AG
 
Protein sequence
MTILDRYILK KQIAPFFFAF ITIVALLQLQ FFSTFAERFI GKGITFVAIV ELLALQSAWM 
VSFALPMAVL VAVVMSFGTL TTTSEMTVCR ASGISLYRVM VPVIVVSLLL SFTVERFNNV
LLPQANYQAK SLMAEIARSK PAFGLTEQAF STLVDGYSMY VRSSDERHGE LRGVVIHDMT
RPEYRTTITA TRGRVEFTPD YQYLVMTLRN GAIHQLQQPE KSGYRSMNFE RYRFVFESSL
SGFTPSSGNR MRADANELSA GELHAIGLEF RRREAVALLH VQAPLVALER LAANTDNSKM
AASPPTLRQE TSAIAATKAL AVIEGEIARV ASELEVASTN RTLYNRYMAA YHKKYSLSLA
CVVFVLVGAP LGVLARRGGF GVGAAISLLF FVLYWMLMIS GEKMAERGVL DPMIAMWMAD
GVMALIGVGL VTKLTQALFS TSR