Gene Cagg_3310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3310 
Symbol 
ID7267786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4008533 
End bp4010050 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content55% 
IMG OID643568123 
ProductABC transporter related 
Protein accessionYP_002464594 
Protein GI219850161 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.156818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00192862 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAACGA CTCCCGTGCT GGAGATGCGC GATATTTCGC GACGGTTTGG TAACACGCAG 
GCATTGGCCG GTGTTAGTTT ACGGTTGTAT CCAGGCGAGG TGCATGCACT GTTGGGAGAA
AATGGCGCCG GCAAATCGAC CTTAATCAAA ATTATGACCG GCGTTTATCA ACCAGACAGT
GGTCAGATTC TGCTTGACGG TCGTCCGGTA CGAATCGGGA GTACGCTCGA AGCGCAGCGA
TTGGGGATCG CGGCGATCTA TCAAGAGCCG TTGATGTACC CAGATCTAAA TGTAGCCGAG
AATATCTTTA TTGCGCACGC CGGGCGTGGG CCGCTCGTGG ATTGGGGCAA GCTGTATCGT
GAGGCAGAAG CAATTCTCGC CCAACTCGAT GTACATCTCG ATGTACGTCA GCCGGCGCGT
GGGCTGAGTG TTGCTGCCCA ACAAACGGTT GAGATCGCGA AAGCGCTATC GTTACAGGTG
CGTGTCTTGA TCATGGATGA GCCGACCGCG GCTCTTTCGG CGCATGAGGT TGAGCAATTG
TTTACGATTG TACGCCGGTT GCGCGATCAA GGTGTTGCCA TTCTCTTCAT TTCCCACCGC
TTAGAAGAAG TTTTCACCAT CGCCGACCGA ATTACGATCT TTCGTGATGG ACGACTGATT
TCGTCCGCAC CGCGTACTGA CGTGACCGTA GCGCAGGCTA TCCGCGATAT GGCCGGGAGG
AGTGTTGAGC AGCTCTTTCC CCGTCGTCAT ACCGTGCGCG ATGAAGTGTT GGTACAAGTG
CGCGATTTGG GTCGTCAAGG CGTCTTTCAA GGTATCTCGT TTGATGTGCG GGCCGGTGAA
GTACTCGGAT TTGCCGGTTT GGTCGGTGCG CGTCGGACCG ATGTTGGTTT GGCGCTATTC
GGTATTGCTC CGGCCGATCA GGGTACGGTA ACGATTGCCG GCCAGCCGGT ACGTATCACG
AATCCGCGGC AAGCGATGCG TTACGGTATT GCCTATGTAA GCGAGGATCG GCGCGGTTTG
GGCTTATCGC TCCCTATGTC GATTGCGGCT AATATCACCC TACCCACGCT CGGTCGGTAT
CTAAACCCGT TGGGCTTGCT GCGCCGTCAG GCTGAATTGG CTACTGCCGA AGAGTTTCGT
CGGCGCCTGG CGATCCGAGC ACCGTCGGTT GAGGTAGAGG TGGGTAAGCT GTCGGGGGGT
AATCAGCAGA AGGTCATGCT GAGCAAGTGG CTCAATACCC GTCCACGTTT GCTTATTCTC
GACGAGCCGA CACGCGGGAT TGATGTTGGT GCGAAGGCTG AAGTTCACCA AATGATCGAT
GATTTGGCCG CTGAAGGAAT TGCGATCATT CTCATCTCTT CCGATCTGCC GGAAGTGTTG
GCAATGAGTG ATCGAGTGTT GGTGATGCGA GAAGGTCGCC AGATGGGTAT CTTTTCGCGC
CACGAGGCAA CGCAAGAACG GGTGTTGGCG GTAGCAATGG GTCAAGAAGC GCACAATGCG
ACAGGAGCGA TGTCATGA
 
Protein sequence
MQTTPVLEMR DISRRFGNTQ ALAGVSLRLY PGEVHALLGE NGAGKSTLIK IMTGVYQPDS 
GQILLDGRPV RIGSTLEAQR LGIAAIYQEP LMYPDLNVAE NIFIAHAGRG PLVDWGKLYR
EAEAILAQLD VHLDVRQPAR GLSVAAQQTV EIAKALSLQV RVLIMDEPTA ALSAHEVEQL
FTIVRRLRDQ GVAILFISHR LEEVFTIADR ITIFRDGRLI SSAPRTDVTV AQAIRDMAGR
SVEQLFPRRH TVRDEVLVQV RDLGRQGVFQ GISFDVRAGE VLGFAGLVGA RRTDVGLALF
GIAPADQGTV TIAGQPVRIT NPRQAMRYGI AYVSEDRRGL GLSLPMSIAA NITLPTLGRY
LNPLGLLRRQ AELATAEEFR RRLAIRAPSV EVEVGKLSGG NQQKVMLSKW LNTRPRLLIL
DEPTRGIDVG AKAEVHQMID DLAAEGIAII LISSDLPEVL AMSDRVLVMR EGRQMGIFSR
HEATQERVLA VAMGQEAHNA TGAMS