Gene Cagg_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3801 
Symbol 
ID7267875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4637015 
End bp4638109 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content59% 
IMG OID643568609 
Productchorismate synthase 
Protein accessionYP_002465073 
Protein GI219850640 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.209902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.33378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGAA ATAGCTTTGG TCACGTCTTT CGGCTGACAA CGTGGGGTGA ATCGCATGGC 
CCGGCAGTGG GGTGTACCGT AGATGGTTGC CCGGCCGGGT TGCCGCTCGA TGTGGCCGAT
ATTCAACGCG AACTCGACCG GCGGCGGGTT GGTCAAAGCC GGGTCAGTTC GCAACGGCGC
GAAGCTGATG AGGTACAGAT ACTCTCCGGT GTGTTTGAGG GTCGCACCAC CGGAACGCCG
ATAACGATGG TTGTTTACAA TACCGATGCC AAATCTCACC ACTACGATAC TATCAAAGAC
GCCTACCGTC CCGGTCACGC CGATTATACG TGGGACGTAA AATACGGTTT TCGGGATTGG
CGTGGTGGTG GGCGTTCGTC AGCCCGCGAG ACGATTGGGC GGGTAGCCGG TGGTGCAATT
GCGCGCAAAC TGTTGGCGAC GGTGGGGGTA ACAATTGTAG GGTATACCCT CCAACTAGCC
GATTTGCGCG CCGAGGTCTT TGATGAAGCA GAGATCGAAC GCAACATCAT GCGGTGCCCT
GATGCGCGGG TGGCGGCGTT GATGGTTGAA CGTGTCGATC AGGCGCGTCG CGAACTCGAT
TCGCTGGGTG GGATCGTTGA AGTCCGGGCG CGAGGTGTAC CTCCCGGCCT CGGTGAGCCG
GTGTTTGATA AGCTCCAAGC CGATATCGGT AAGGCCATGT TCTCGATTCC GGCTATCAAA
GGAGTGGAGA TTGGTGAAGG GTTTGGGGTG GCAATGCTGC GTGGCTCGCA GAACAACGAT
CCCTTCATCC GGCGCGAGGA TGGTTCAATC GGTACGACCT CGAACCATCA CGGCGGTATT
CTCGGCGGCA TTTCAACCGG CGAAGAGATC GTGGTACGAT TGGCAGCCAA ACCACCGGCC
AGTATTGCCC GCCCACAACA AACGGTCGAC CGCGACGGTA ACCCGGTAAC GATTGAGGTG
CATGGTCGCC ATGACCCAAC GGTCTTGCCG CGTCTCGTGC CGGTGGCCGA AGCTATGCTG
GCGTTGGTGC TGGCCGATCA TCTGTTGCGA CAGCGGCTTG CTCGGGTGTC GTGGTCGGAG
CGTGATGATG GGTAA
 
Protein sequence
MPGNSFGHVF RLTTWGESHG PAVGCTVDGC PAGLPLDVAD IQRELDRRRV GQSRVSSQRR 
EADEVQILSG VFEGRTTGTP ITMVVYNTDA KSHHYDTIKD AYRPGHADYT WDVKYGFRDW
RGGGRSSARE TIGRVAGGAI ARKLLATVGV TIVGYTLQLA DLRAEVFDEA EIERNIMRCP
DARVAALMVE RVDQARRELD SLGGIVEVRA RGVPPGLGEP VFDKLQADIG KAMFSIPAIK
GVEIGEGFGV AMLRGSQNND PFIRREDGSI GTTSNHHGGI LGGISTGEEI VVRLAAKPPA
SIARPQQTVD RDGNPVTIEV HGRHDPTVLP RLVPVAEAML ALVLADHLLR QRLARVSWSE
RDDG