Gene Cagg_3117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3117 
Symbol 
ID7269535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3776742 
End bp3777737 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content56% 
IMG OID643567938 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002464411 
Protein GI219849978 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.3456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGAT GGTTTTTGCT GACCCTCGTC ATCCTGCTGA CCGCCTGTGG CGGTGGTGCC 
GCTACGCCGA CAACGCCGAC CCAAGCTCCT CTCACCAAAG TGCGGGTGGG GCTGGATTGG
ACGCCGAATA CCAACCATAC CGGCTTGTAC GTCGCTCAGG CGAAAGGCTA CTATGCCCAG
CAAGGCCTAG AGGTTGAGAT CCTCGGCGCA CAAGAGGGGG GGACTGTCGA GCAGTTGGTG
GCAACGGGCC GCCTCGATTT TGGTATTTCG CATCAGGAAG GTGTAACTCA GGCTCGGGTC
GAGGGCGTAC CGATTGTCTC GATTGCTGCG ATTATTCAGC ACAATACGAG TGGTTTTGCC
AGCCGTGCTG AAGAGGGCAT CACCAGCCCA CGCGATTTTA TCGGTAAAAA ATACGGTGCA
TTCGGATCGC CTGTCGAACA AGCAGTTATT AAGGGTTTGC TCGAATGCGC TGGAGTTGGC
GATCAATTTG ATCAGGTGCA GTTTGTTGAT ATTGGTAGTT CCGATTTCTT CGTCGCCACC
GAGCGTGATG AAGTAGATTT TGTCTGGATC TTCAAGGGTT GGACGGGAAT CGAGGCTGAG
GTGCGGGGCG TGCCACTTAA CATTGTGATG ATGAATGATC TCCAGTGCAT TCCCGATTAC
TACACGCCCG TGCTCATCAC CGGTGAGAAG CTAATTGCCG AACAGCCCGA TCTCGTGCGA
CGCTTCCTCG CTGCCACGAG TGCCGGGTAT CGCTTTGCCA TTGAGCAACC GGGCGAAGCA
GCCGATATTT TGCTCAAAGC TGCGCCCGAA CTCGATGCCG AACTTGTCCG GCGTAGTCAG
CAATATCTGG CCGGTCAGTA TCAGGCTGAG GCGGCACGCT GGGGCGAGCA GAAGCTTGAA
GTCTGGCGTG CCTACGCACA GTGGATGGCC GATCGTAACC TGATCGCTCG CATGATCGAG
CCGGAAAAGG CGTTTACCAA CGATTTCTTG CCGTAG
 
Protein sequence
MRRWFLLTLV ILLTACGGGA ATPTTPTQAP LTKVRVGLDW TPNTNHTGLY VAQAKGYYAQ 
QGLEVEILGA QEGGTVEQLV ATGRLDFGIS HQEGVTQARV EGVPIVSIAA IIQHNTSGFA
SRAEEGITSP RDFIGKKYGA FGSPVEQAVI KGLLECAGVG DQFDQVQFVD IGSSDFFVAT
ERDEVDFVWI FKGWTGIEAE VRGVPLNIVM MNDLQCIPDY YTPVLITGEK LIAEQPDLVR
RFLAATSAGY RFAIEQPGEA ADILLKAAPE LDAELVRRSQ QYLAGQYQAE AARWGEQKLE
VWRAYAQWMA DRNLIARMIE PEKAFTNDFL P