Gene Cagg_1392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1392 
Symbol 
ID7267244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1717026 
End bp1718273 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content55% 
IMG OID643566235 
Productglycoside hydrolase family 18 
Protein accessionYP_002462735 
Protein GI219848302 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.282919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGATCG GCACGACAAT CGCCGGAGCA ATCTACATTG GTGCCTTGGT CGTGGCAGGG 
CTACTGCTCT GGCAGACGGT AGAAATGTCA CGTACCCTCA TCGCGCTCTC GGCCAACGCA
TCAACGCCAA CACCGGTACC AACGCCAATA CCGACAGCCG CGCCGTTGTT GGTGTTACCA
ACCCCTATCC CACCAACACC CTTGCCTACC ACCGCACCGC CGCAACCGGA AACCTTTGGC
TATCACCCCA AGAGCGGGCG ATACATTGCC GTTTGGCTAC CACCGAACTT CACCGGTGAT
GCCCGTGAGT CGTTTTTCGC CAACGTTGAT ATTATCGACG ACATCAGTCC ATTCTGGTAC
ACAACCGATG CTAGCGGTCG GCTGTACGGG CAGCGCGACG ACGATCTGGT GCGCATTGCC
CATGAAAACA ACATACGAAT CATTCCCTCG ATCCACAATG TCGGCAATCC CGGTGCGGTT
GTACCGGTGT TAACCAATCC ACAGCTCCGT GCGCGCCATA TTCAGAATAT CGTTGATGAA
GTACTGGCTC GCGGCTACGA CGGCATCGAC ATCGACTACG AATCGCTAGA TCCCTCGCTG
CGCGACGATT TTACCGCGTT TATCATTGAC CTGGCTGCTG CGCTACACGC ACACAACAAA
CTCTTGACCG TCGCCGTTCA TGCTAAAGAC CGTGATGATG GCGGCTTAGG GGCATTCCAA
GACTGGCGAG CGATCGGACC GCATGTTGAT CAATTGCGGA TCATGACCTA CGATTATCAT
TGGCGCGGCT CAGGACCAGG ACCGGTTGCA CCGGCCTACT GGATTGAAGC GGTAGCCAAT
TACGCTCGTG AAGTTGTTGA TCCGGCCAAA GTGTTGATCG GTGTTCATTT CTATGGCTAC
GACTGGCCAC CCAACGGCAA CGCAACGGCA CGTCCATGGC GTGTGATCGA GGAGATTATC
AACGAGTATC AACCGACGGT AAGCTTCATT GAACGGAATG CACGTGGTCG GGTCGGTGAG
AGCACCTTTA CCTATCGCAC GAGCGCCGGT ACGCGCACCG TCTGGTTTAT GACCGATACC
GGTCTCGCCG ACAAAATTAC CACCGTGCAG AAGCTTGATC TGGCCGGCAT TGCCATTTGG
CAATTGGGGT ACGAACGTCC TGAATATTGG CAAACAGTAC GAACCAATCT CGTGCAGGAT
TCAACGTTGA TACAACGCGC ATTAAACACC TTGTTACCAG ACCCCTAG
 
Protein sequence
MRIGTTIAGA IYIGALVVAG LLLWQTVEMS RTLIALSANA STPTPVPTPI PTAAPLLVLP 
TPIPPTPLPT TAPPQPETFG YHPKSGRYIA VWLPPNFTGD ARESFFANVD IIDDISPFWY
TTDASGRLYG QRDDDLVRIA HENNIRIIPS IHNVGNPGAV VPVLTNPQLR ARHIQNIVDE
VLARGYDGID IDYESLDPSL RDDFTAFIID LAAALHAHNK LLTVAVHAKD RDDGGLGAFQ
DWRAIGPHVD QLRIMTYDYH WRGSGPGPVA PAYWIEAVAN YAREVVDPAK VLIGVHFYGY
DWPPNGNATA RPWRVIEEII NEYQPTVSFI ERNARGRVGE STFTYRTSAG TRTVWFMTDT
GLADKITTVQ KLDLAGIAIW QLGYERPEYW QTVRTNLVQD STLIQRALNT LLPDP