Gene Cagg_1309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1309 
Symbol 
ID7268600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1609169 
End bp1610296 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content58% 
IMG OID643566152 
ProductNMT1/THI5-ike domain protein 
Protein accessionYP_002462653 
Protein GI219848220 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000217604 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGGCC GTTTGCGACG ACACCTTGCT GATGAATTAC TACATACGAT CAGGCGCTTC 
CACACAAGTT GCGGAATGTT CCATTGTAGA GATCACGCAA GCCGTCAGGT TATCCTGATG
CTTCTCACCA TCTTTGTCAT CACAACCGGT TGTAGCAACC CAACTCCCAC CAACAATGAC
ACGCGCGGTA CACCAGAAGT GACCGACGTT ACAATGCGTT TGCAATGGTT ACCGCAATTC
CAATTTGCCG GTTATCTCGT CGCCGAAGCG CGCGGCTACT ACCGCGACGC CGGTCTCAAC
GTAACTATTC TCCCCGGCGG GCCTGATGCC GTCCCGCTCC CCTTGGTTGC CACCGGCGCC
AACACCTTTG GCAGCACCGG CGCCGATACT ATTTTGATCA GTCGCGCCCA GGGGATCGAG
GTCGTAGCGC TGGCAACGTG GTTCCAAGTT AGTCCGGTTG CATTCATGGT ACATCGCGAC
AGCGGCATCC GCTCGCCGCA AGACTTCGTC GGCCGGCGGG TAGCGATGTT CTACGGAGAT
AATGTTGAAA CCGAATATCG AGCGTTGCTG GCCGCCACCG GCGTCGATCC AACCAGCATC
AACGAAATCC CCGGCGACTA CAGCATCGCT CCCTTTCTCG AACGACGGGC CGATGTGTGG
CCGGTCTATG CGACCGACCA GCCCTACACC GCTCGCGCAG CCGGCGCCGA CATCGAGCTG
ATTGTCGCCC GCGATTACGG CGTCGAACTG ATGGGGGATG TGCTCTTCAC CACTGCCGAA
TTCGCCCGTA AGAACCCCAA CACCGTGCGC GCGTTTGTCC AAGCCACGCT GCGCGGTTGG
CAAGACGCGA TTACCGACCC GGCAGCAGCC GTCAATATCA TTTTGGCCCG TAACCCTGAT
TTTGATCGCG GCCATCTTGA ATTTGAGGCG ACGGAAACGA TTAAACTGCT GCGCTACGGT
ATTGGCGCGC GCTGTGTCGG AGCTAGCGAC CAGCAGGCAT GGGCCAAAGA GGCCGAACTG
CTGCGATCGC TAGGGGTGCT GAAAACGGCG GTTGATCCGG CGACGGTGTT GGTCACTGAT
GCGGTTGACG ACTATTACCG CGCACGCGGC ATTGAATGCC AGCGGTAA
 
Protein sequence
MLGRLRRHLA DELLHTIRRF HTSCGMFHCR DHASRQVILM LLTIFVITTG CSNPTPTNND 
TRGTPEVTDV TMRLQWLPQF QFAGYLVAEA RGYYRDAGLN VTILPGGPDA VPLPLVATGA
NTFGSTGADT ILISRAQGIE VVALATWFQV SPVAFMVHRD SGIRSPQDFV GRRVAMFYGD
NVETEYRALL AATGVDPTSI NEIPGDYSIA PFLERRADVW PVYATDQPYT ARAAGADIEL
IVARDYGVEL MGDVLFTTAE FARKNPNTVR AFVQATLRGW QDAITDPAAA VNIILARNPD
FDRGHLEFEA TETIKLLRYG IGARCVGASD QQAWAKEAEL LRSLGVLKTA VDPATVLVTD
AVDDYYRARG IECQR