Gene Cagg_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2501 
Symbol 
ID7269347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3045634 
End bp3047238 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content58% 
IMG OID643567327 
ProductABC transporter related 
Protein accessionYP_002463808 
Protein GI219849375 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR01166] cobalt transport protein ATP-binding subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAC TCGACCGTGT CAGCTACACC TACCCCGGTG CCACCACTCC CATCTTCCGC 
CAACTGAGCC TCCATATCGC ACCGGGCGAG TTTCTCCTGA TCTGCGGACC GAGCGGAGCC
GGCAAAAGCA CGTTGCTCCG TCTCCTCAAC GGCCTCATCC CACACTTCTA CGGTGGTACA
TTCGGGGGAC GTGTCCGAGT GTGGGGCCGC GACACGTTAA CCCACCAGCC CCGCGATCTA
GCCGACCTAA TCGGCTTTGT CGTTCAAGAC CCCGAAAGTC AGTTTGTCGT CGAGCGGGTC
GAAGACGAAA TCGTCTTCGC TATGGAAAAC CTGGGTGTCC CCACACCGGT TATGCGCCGA
CGAGTCGAAG AGGTACTCGA TCAACTAGCG ATAGCCCACC TACGACATCG TCACGTAGCA
ACCCTTAGCG GCGGCGAACA GCAGCGCGTC GCCATTGCCG CAGCACTAGC CGCCCAGCCG
CGAGCATTAG TCCTCGACGA ACCAACCAGC CAACTCGATC CACATGGAGC AGAAGAGGTC
CTAACCGCGT TGCAAAAACT GAACGCCGAT CTCGGCATTA CCATCATCTT ATGCGAGCAT
CGGCTGGAAC GAGTCGTACA ATACGCCGAC CGTATCCTCT ATCTTACACC GAATGGCCAA
TACGAAGTAG GCACACCGCG TGAGGTACTA TCCCGCATTG AACTACGTCC ACCGTTACAA
CATCTCGCCA CAGCATTAGG CTGGGAACCA CTCCCATTGA CGATTAAAGA AGGGCGAAGA
TTTGTTCCCA CCAATCTACC GGCGCCACCA CCTACCCCTC CCGTATCGCC ACCTGCTCCA
CCGCTTGCCC GCCTACGCAA CGTCACCGTC CAATATGGTC AGCACGAAGC GCTCTATCGC
GTAAACCTCG ATCTGCCATA CGGTACACTC ATTGCCCTTA TGGGACGGAA CGGCAGTGGC
AAAAGCACCT TACTCAAAAC CCTCATCGGG CTAGTCCGTC CAACCGAAGG CAAAATCGAG
CTAGAAGGGC GTGACATTGC GCCACTTACC GTCGAGGAAC TCGCCCGCAC GGTCGCATAC
GTCCCGCAAG ATCCGCATAG CATCCTCTTC CACGACACCC TCCACGAGGA GCTAGCGTTT
AGCCTACGCG GTCGCGGCTT ACCACCGTCA CCAACGCTGA TCGACACCAC ACTGCGCAAA
TTCGGTATTG CCCATCTCGC CACCCACTAT CCGCGCAATC TCAGCGGTGG TGAAGCGCAA
CGTGCCGCCT TAGCGACGGC GCTAGTCGGA GAACCACGTC TGATCTTGCT CGATGAACCG
ACCCGCGGTT TGGATTATCA AGCCAAAACC ACCCTCATAA CCTTATTGCG CCGGTTGTGT
AGCGAGGGTC GTACCGTTGT GATAGCAACT CACGATGTTG AGATGGCTGC CGCCTGCGCC
GACCGGGTTA TCCTGCTCGG TGAAGGTGAG GTAGTAGTCG AAGGGCCGGC TACTGAGTTA
CTCGGCGACA GCTTGCTCTT CTCATCGCAG GTCGCCAAAC TCTTTCCCGG CTACGGCTGG
CTAACAGTAG CCCAAGCCTT GGCCGGCCTC GGCAAAACAG CCTGA
 
Protein sequence
MIELDRVSYT YPGATTPIFR QLSLHIAPGE FLLICGPSGA GKSTLLRLLN GLIPHFYGGT 
FGGRVRVWGR DTLTHQPRDL ADLIGFVVQD PESQFVVERV EDEIVFAMEN LGVPTPVMRR
RVEEVLDQLA IAHLRHRHVA TLSGGEQQRV AIAAALAAQP RALVLDEPTS QLDPHGAEEV
LTALQKLNAD LGITIILCEH RLERVVQYAD RILYLTPNGQ YEVGTPREVL SRIELRPPLQ
HLATALGWEP LPLTIKEGRR FVPTNLPAPP PTPPVSPPAP PLARLRNVTV QYGQHEALYR
VNLDLPYGTL IALMGRNGSG KSTLLKTLIG LVRPTEGKIE LEGRDIAPLT VEELARTVAY
VPQDPHSILF HDTLHEELAF SLRGRGLPPS PTLIDTTLRK FGIAHLATHY PRNLSGGEAQ
RAALATALVG EPRLILLDEP TRGLDYQAKT TLITLLRRLC SEGRTVVIAT HDVEMAAACA
DRVILLGEGE VVVEGPATEL LGDSLLFSSQ VAKLFPGYGW LTVAQALAGL GKTA