Gene Cagg_3329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3329 
Symbol 
ID7267069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4039277 
End bp4040953 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content63% 
IMG OID643568141 
Productpolymorphic outer membrane protein 
Protein accessionYP_002464612 
Protein GI219850179 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000300518 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCGTGATG GCGGGGCAAT CGCCATCAGG GACGCCAGGC GTGTCGATGT GACGGATAGT 
CACTTCGCCA ACAATACCGC CGGATCCGCG ACCTTGAGAG CCAGCGGTGG CGCGATATGG
GCGGTGAAGG CCGACTACAC GCCGGCAGAC CCACCACTGA CCATCACGGC CGGCACGTTC
ACCGCAAATT ATGCATGGGA CCGAGGCGGG GCGATCTTTG CACAATGGTA TGCGACGGTG
ATCGCGCAGA GCCGGTTCAC CGAGAACGGC TCCGACTATG ACGGTGGTGC CATCTCCATG
TCCATGGGAT CGCTCGTCAT TCGGGATTCC GAGCTGACGC AGAATCGATC GGTGTCTTAT
GGAGGAGCAA TCTACGCATA CATCCACGAC CATACCCTCA CCGTGACCAA CACCGTCTTT
CGCGGCAACG AGTGCGACGG TGACGGGGGG GCGATCTGGA AACGACGGGG CCATGCACGG
ATTGAGGCCT CTCGGTTCAT CGAGAATCGC ACCGGCGGCT TCGGCGGGGG ACTAAAGGTT
ACCGTTGGCA CGACCGACGT TATCGGTAGT GAGTTCAGGG GTAACACGGC CCGGCTTGGT
GGCGGGATTC ACAGTGATAC CGAGACCCTG ACCGTGCGGG CATCGTCCTT TGTGGCAAAC
ACGGCCCGGC TTGGTGGCGG CATTGCCAAT GATACCTTTG GCCATGGAGG ATCGGCCCGC
ATCGAAACCA CGACGTTTGT CCGAAACGCC GCAACCGTCG GCGACGAACG CGAGGCGGAT
GCTCTCCCAC CTGTCGGGTC CGGCGACGAA GATCCTCCAC CTGTCGGGTC CGGCGACGAA
GATCCCCTAC CTTTCGGATC CGGCGGGGCA CTCTTCAACA CCGCCACGCT CGAAGTCGTC
GCTGCGACCT TGCACGAGAA TACGGCGGAA CGGCAGGGTG GCGGCATCTT TAGCGCGACC
AAATCCGACA ACCCGCCGAG CCGGCTAACG GTGGTAAACA CCATCATCAC CGGCAGTCCC
CACGGCGGGG ATTGTGTCTC CTTCTCACCG GTGGACGGGC ATCACACCAT CATCGGAACC
CCGACCGCGG CATGTGGTCT GAGCGAAAGC CCGCTCATCG GTATCGACCC CCGGCTGGGC
GAGTTGACCG GCGACCCACC CTACCTGCCG CTCCTTCCCG ACAGTCCGGC CATTGATGCC
GGCGATAGTA CGATGTGTCC CGGTCCGGGG CAAAACGGGG TCCCGCGTCC GGTTGGGTCG
GCATGCGATA TTGGAGCAGT GGAATGGACG GCGGATACCG ACCCGCCAAC CCTGACCGCC
GTGGCATGCA TCACCGATCC GCCCGCCGGC ACGTCGGTGC AGTTTCGGGT CGTGTTCTCC
GAAGCCGTTC GCGATGTCGA TGCGACCGAT CTGGTTGTTG AGGCATCACC CGCATCAACA
CCGGCAACGA TTACCGACCT GACGGCACGG TCTGCCGCCG AGTACCTCGT GACCGTCGGC
GAGTACGGGT CGGATACCGA AACCCTGACC CTCGCCATCG CGTCAACGGC GACCATTACC
GACCTGGCCG GGCATCCGCT CGTGCCGCCG TCGGCGATGA CCGGAGGACG CTGCGCGGTG
GGACGTTCCC CGTCCACGCC GGACCGTCAG ATCTTCCTCC CGTTGATCGT CCGCTAA
 
Protein sequence
MRDGGAIAIR DARRVDVTDS HFANNTAGSA TLRASGGAIW AVKADYTPAD PPLTITAGTF 
TANYAWDRGG AIFAQWYATV IAQSRFTENG SDYDGGAISM SMGSLVIRDS ELTQNRSVSY
GGAIYAYIHD HTLTVTNTVF RGNECDGDGG AIWKRRGHAR IEASRFIENR TGGFGGGLKV
TVGTTDVIGS EFRGNTARLG GGIHSDTETL TVRASSFVAN TARLGGGIAN DTFGHGGSAR
IETTTFVRNA ATVGDEREAD ALPPVGSGDE DPPPVGSGDE DPLPFGSGGA LFNTATLEVV
AATLHENTAE RQGGGIFSAT KSDNPPSRLT VVNTIITGSP HGGDCVSFSP VDGHHTIIGT
PTAACGLSES PLIGIDPRLG ELTGDPPYLP LLPDSPAIDA GDSTMCPGPG QNGVPRPVGS
ACDIGAVEWT ADTDPPTLTA VACITDPPAG TSVQFRVVFS EAVRDVDATD LVVEASPAST
PATITDLTAR SAAEYLVTVG EYGSDTETLT LAIASTATIT DLAGHPLVPP SAMTGGRCAV
GRSPSTPDRQ IFLPLIVR