Gene Cagg_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0033 
Symbol 
ID7269030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp52563 
End bp54338 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content56% 
IMG OID643564906 
Producthypothetical protein 
Protein accessionYP_002461422 
Protein GI219846989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATT TGTCATCTTC GGTGCGCTGG CTGATCGTGA TCGGTATCGT ACTTGTCTGC 
ACCGCACCCC CCGTTCGCGC TGCCGGTGTG GTCGGAAACG GCACGCCGGC AAGCTGCACC
GAAACCGCTC TCCGCGCTGC TGTTGCCGGT GGTGGGCGCG TAACGTTCAA TTGTGGTCCG
CAACCCGTGA CGATTACCCT TTCCGGTCAA TTGGAATTGC GCCAAGATAC CGAACTTGAC
GGTGGTGGTC CCCAGCAAGG TGGGCGTGTC ACCCTGAGTG GTAATGGTCG TACACGCCTG
ATCTGGGTGT ACGATGCTAC GCTCACGATC CGCAACTTGA CCTTGATCAA CGGTCGTAGT
GTGGAAGGCG GGGCTATCCG TGCGGCCGGT CTGAATACGC GAGTGTTTAT CTACAACAGC
ATCTTCCGCA ATAACGATAG TACCGCCGGC AAGGATGAAG AAGGCGGTGG CGCGATTTCA
ATGCATTTTG GGCAGCTCCA TATCGAAGAT AGTGTATTCG AGAACAACCG CGGTATTAAC
GGCGGTGCTA TTTACAATCT GCGTTGCCCG ATCACCGTCC TCCGCTCGAT CTTCCGCAAT
AACGATAGCT CTTATGGTGG CGTTGTCGCC AATTTCGGGT TTGGCGGTGC TATCTATAAC
GATGGCGCCG GCCCGGCAGG GACGGGTGGT CAGATCGTAA TCCGCGATAG CATGTTTATC
GGGAATAAAG CACGTAATTT CGGTGGAGCC GTCTACTCGT ATCTCTACTA CCCCGACCGT
TCTGAAATCG AGCGTAGTTT CTTTGCGGAT AATATTGTCT ATCTCAACAG CAACAATCGC
GCAAGCGGTG GTGCGCTGAT GCATCACAAC GGCCCACTGA CGCTGCGCGA CTCTACCTTC
GTAAGCAACC GCTCGGAAGA TGCTGGCGGT GCCATCGTTG TTGCCCAAAC GACACTTACG
TCGGGGTGGA ATCGTTCGAC TCTCACCAAT CTTACCGTGG TTGGCAATCG GGCTGATGCG
GCTACTGCCG ATCGCGGGAA TGGTGGTGGT CTCTACTTCA ACGGTGGGCA GGCAACTGTC
ACGAATGTTA CAGTTGCTTA CAATTATGCC GACCGGCTCG GTGGTGGTAT ATACAATACC
TCAACCAACG CGGCAGATGT TGAACTACGC AATACGATCA TCGCCGGAAA CTTCCTCGGT
AGTACGCACG ACTCGGTACA GTGTTTTGGC AGCTTGCGCG GCAGTCACAA TCTGCAAAGT
CCGGTGGGGC GTGCCTGTGT TAGCGGGATT GCGCAAGCCG ATCCTCGCCT GGAGAATGCC
ATTGGTTATC ACGGTGGAAT ATTGCCGACG CTTGCCCTCC AGGCCGGTAG TCCGGCGATC
AACGCCGGTG TCGATTGTCC TCTGCTCGAT CAACGGGGGG CAGTGCGGGT TGGCGCATGC
GATCTCGGCG CCTTCGAGTA TGGTGGCGTA GCACCGGCTG CGCAGCTCGA TCCGCCGGTG
TTGGTGAGTG TAACTGCTGT GAATGGTGGC CCGCTGGTGC AACTGACCTT CAATCCGGTC
GCCGGTGCCG GACGCTACGA GCTTGAAGTT CAACGTGAGG GAAGTTCGTC GTGGGTGCAG
TTGTTGACCG ATAACACGGT AGTCCTCGAT GTCGGTCAGT ATATCTTGCG ATTGCGTGCC
TGTAATGATG TTGTCTGTGG TACTTTTGGG AATGAATTGA GTGTGGTTGT GACCCAAACA
CCGCGGAAAT CATTTATCCC CTTCGTCGGA CGATAG
 
Protein sequence
MNNLSSSVRW LIVIGIVLVC TAPPVRAAGV VGNGTPASCT ETALRAAVAG GGRVTFNCGP 
QPVTITLSGQ LELRQDTELD GGGPQQGGRV TLSGNGRTRL IWVYDATLTI RNLTLINGRS
VEGGAIRAAG LNTRVFIYNS IFRNNDSTAG KDEEGGGAIS MHFGQLHIED SVFENNRGIN
GGAIYNLRCP ITVLRSIFRN NDSSYGGVVA NFGFGGAIYN DGAGPAGTGG QIVIRDSMFI
GNKARNFGGA VYSYLYYPDR SEIERSFFAD NIVYLNSNNR ASGGALMHHN GPLTLRDSTF
VSNRSEDAGG AIVVAQTTLT SGWNRSTLTN LTVVGNRADA ATADRGNGGG LYFNGGQATV
TNVTVAYNYA DRLGGGIYNT STNAADVELR NTIIAGNFLG STHDSVQCFG SLRGSHNLQS
PVGRACVSGI AQADPRLENA IGYHGGILPT LALQAGSPAI NAGVDCPLLD QRGAVRVGAC
DLGAFEYGGV APAAQLDPPV LVSVTAVNGG PLVQLTFNPV AGAGRYELEV QREGSSSWVQ
LLTDNTVVLD VGQYILRLRA CNDVVCGTFG NELSVVVTQT PRKSFIPFVG R