Gene Cag_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0403 
Symbol 
ID3747781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp468043 
End bp469044 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content42% 
IMG OID637772931 
Producthypothetical protein 
Protein accessionYP_378719 
Protein GI78188381 
COG category[S] Function unknown 
COG ID[COG4804] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAGA GCGAGCAGAG GTATGGAAAA CAACTTATTT CGGTTCTATC AGCACAGCTA 
ACACAGAAAT ATGGCTCAGG TTTTTCTGTT ACCAATCTTA AATACTTTAG AACCTTTTAT
GTAACCTATC CCGATCGTTT TGACACAATT GGCTACCCAA TGGGTAGCCA ATTACCCCAA
GAAGAAAAAA GTCGCCCATT GGGCGACCAA TTGCCCGAAG CAGAAAAAAG TTACCCAATT
GGTAGCGGAT TTTCACCACA ATTAACATGG TCGCATTATC GCGCTTTGAT GCGTGTGCAA
AACGAAAAAG CTCGTGAGTT TTATGAAAAC GAAGCTATTG ACGGAGGATG GGATAAACGC
ACGTTAGAGC GGCAAATTCA CACGCAATAC TACGAACGTT GCCTTCATAG CCAGCAACCG
GAGAAAATTA TTGCTGAAGG GAGAAAACTA CAAAAAGAGG TGCCGTTAGC AACGGACATC
CTGAAAAATC CTTACGTATT GGAATTTCTT GGCTATCCGA ACTTTGCAGA ATTACGGGAA
TCGGATGTTG AACGGGCTAT CATTACGCAT CTCCAACGGT TTCTGTTAGA ACTTGGTAAC
GGTTTTGCCT TTGTGGCGCG TCAAAAGCAT ATTCGTATTG ATGAGGATGA TCGTTTCATT
GACTTGGTGT TCTATCACTG CCGCTTGAAG TTTTATTTAC TCATAGACTT GAAATTAGGA
AAGCTAACTC ATGCAGATGT TGGGCAAATG GATGGCTACG TTCGCATGTT TGATGGGCTT
TTTACGGCAT TGGACGATAA CCCCACAATA GGGCTCATTT TATGCACTGA AAAATGTGAT
ACCGTTGCTC GTTACTCTGT ACTTAACGAT AGAAAACAGA TTTTTGCATC AAAATACCTA
CCAAGCCTGC CCAGCGAAGA ACAATTACAA ATCGAAATTG AAAAAGAACG ACGGCTTATT
GAAGCTGCGT TGGAAGAACA AAAAGCCTGT AAGCATGAGT AA
 
Protein sequence
MGKSEQRYGK QLISVLSAQL TQKYGSGFSV TNLKYFRTFY VTYPDRFDTI GYPMGSQLPQ 
EEKSRPLGDQ LPEAEKSYPI GSGFSPQLTW SHYRALMRVQ NEKAREFYEN EAIDGGWDKR
TLERQIHTQY YERCLHSQQP EKIIAEGRKL QKEVPLATDI LKNPYVLEFL GYPNFAELRE
SDVERAIITH LQRFLLELGN GFAFVARQKH IRIDEDDRFI DLVFYHCRLK FYLLIDLKLG
KLTHADVGQM DGYVRMFDGL FTALDDNPTI GLILCTEKCD TVARYSVLND RKQIFASKYL
PSLPSEEQLQ IEIEKERRLI EAALEEQKAC KHE