Gene Cagg_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1941 
Symbol 
ID7268857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2375494 
End bp2376441 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content55% 
IMG OID643566779 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002463272 
Protein GI219848839 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.876036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000270678 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAACAC CTATTACCTT AGCCCTCGAT TGGACACCCA ACACAAATCA TATTGGGTTC 
TACGTAGCTA TTGCCAAGGG ATGGTACCGT GACGCCGGAA TCGAACCGAT CATGCTGTCC
CCGGAAGAGG ACAATTATCA GACGACACCG GCTGCGAAGG TGGTAGCAGG AAGAGCGTTA
TTGGCAATTG CTCCCTCGGA GAGTGCTTTG AGTTACCATC TTCACCCCAC CAAACCATCG
TTGGTTGCCA TTGCAGCGCT AGCACAACGC GACACAAGTG CGATTGTGAC ATTAGCCAAC
AGCGGCATTG ATCGACCGGC CAAACTTGAT GGTCGTCGCT ACGCTTCGTA CAACGCACGA
TTTGAGCGCG CAATCGTAGC GCAGATGATC CGCAACGACG GAGGGAAGGG CGAATTCGAT
GAGATCTTTC CACCCAAACT CGGCATCTGG GAAACATTGC TCACCAGCGT TGCCGATGCA
ACATGGGTCT TTATGCCGTG GGAAGGGGTG CAAGCCCGTC GAGCCGGGAT CGCTCTCAAC
GCCTTCCACC TCGACGACTA CGGCATTCCT TACGGCTACA CGCCGATATT GTTGGCCCAC
CCGGATGCAC TCCGCACGCA TCCAGATGCC CTGCGCGCAT TATTGAATGC CACTGCCGAG
GGCTACCGCT TCGCCGTTCA TCATCCCGAT GAAGCCGTGG CAGCACTTAT CACGGAGGCT
AAGCACCCGA GCTTGCAGGA TCGCGATTTT GTGACCGAAA GTTTGTATGA ACTCGCCCCC
GCTCTGCTGA CCGCTGATGG TCGGTGGGGG GTGATGGACG GCCAGCGGTG GCAAGCCTTT
GTGACGTGGC TCGACCAACA AGGTCTGATT GTGGATCGGA ATGGACAGCG CATCCCGTTA
GCACCAGATA CATACCTTGC TCTGTTTACA AATGAGCTTT GGAACTAG
 
Protein sequence
MSTPITLALD WTPNTNHIGF YVAIAKGWYR DAGIEPIMLS PEEDNYQTTP AAKVVAGRAL 
LAIAPSESAL SYHLHPTKPS LVAIAALAQR DTSAIVTLAN SGIDRPAKLD GRRYASYNAR
FERAIVAQMI RNDGGKGEFD EIFPPKLGIW ETLLTSVADA TWVFMPWEGV QARRAGIALN
AFHLDDYGIP YGYTPILLAH PDALRTHPDA LRALLNATAE GYRFAVHHPD EAVAALITEA
KHPSLQDRDF VTESLYELAP ALLTADGRWG VMDGQRWQAF VTWLDQQGLI VDRNGQRIPL
APDTYLALFT NELWN