Gene Cagg_2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2572 
Symbol 
ID7267161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3134768 
End bp3136825 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content57% 
IMG OID643567396 
Producthypothetical protein 
Protein accessionYP_002463877 
Protein GI219849444 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCCG GCAAACTCAC TTCCTCGCGC ACAACGATCT GGCTGGCAGC AATTTGGCTG 
GCGGCAATGA TCAGCATAGC CGTACCGGGA TTGCTTTGGT GGCTACCGTG GTCATTCTTC
AGCACGCCAT TGGTCGTAAT CAGCGGCGGG ATGCTGTTTA CCCTACCGGG GTTAGCCCTC
TTACGTTGGC TCCACCCTAG CCCGTTGCAT GGGTTTGAAC GCCTGGCATA CTCGGCAAGT
CTGAGTTGCG CGGTCCTTCC TCTCATTCTC CTGTTTAGTG AACCTATCGG CTGGCGCTGG
AATGGACTGT CTGCCTGGTT GGTTATCGGT GGCTGTGCGG TACTGGCCTT GTGGCCGCAG
ACATCGGCGA TACGCCGAAC CGCTCCCGCT TCAGGCAATT GGCGACAAGC TGCTCTACCA
ATTCAACCAC GCCACCATCT GATCGGATGG GTGTTGATCT TTCTAACCGG TGCAGCCTGT
GCGGTACGCT TGTTTCTTGT ACGTGATGTA CCGCTTGGGT TTTACGGCGA TTCGTATCAT
CATACGGTGA TCACCCAGCT CCTCATCGAC CACGGCGGCC TGTTCCGATC ATGGGAACCT
TATGCTCCTG CCGTCACCTT CACCTATCAC TATGCGTTCC ACGCGATGGG TGCCTGGTGG
CATTGGCTCG CCGGTATCCC GGCAACCCAA GCGGTGATCT GGACGGGACA GGTGATGAAT
GGTTTGGCAG TCCCGCTGAT GTATTTGCTG TCCACTCGTC TTACCAATTC ACGTCTGACC
GGTTTGTGGG CGGCTGTGAT CGTGGGCTTT GTGAGTGTGT ACCCGGCTTA TTACGTCAAT
TGGGGACGCT ACACACAGTT AGCCGGTCAG ACGGTACTAC CGGCGGCGGC GATGGCTTGG
CTGACGGTGA TCGATGGTGC ATTGCACCTA CAACGGTCAT GGAGACAGTT AGCGCACCAG
TTGGTCCTGG CAATAATCAC CGGTGCCGGC CTGGCGGTCA GTCATTACCG GGTAGCGATC
CTTGCGATCT GCTTGCTGGT GGCCTACACG GTAACGGTGC TCGTGACGGC TTGGCCAATC
GAACGTCGAT CATTGATCCG CTTCCTGACG GTAGGAGCCA TCGGTGCTAT CGGCGCACTG
CTGCTAGCAT TACCTTGGCT CTGGCGAGTA CGAGAGGGGC AGATCACGCG ACTGGCTACG
AACCTCGTCC TCAACAATAG TGAAGCCAGT AATCCTTTTT CACCCGAAAC CGTCGGGGCA
GCATTTCAGC ATGGTCTCTT CCCTTTGGCC GCGTTGGGCT TGGGTAGTCT GCTCTGGCGA
CGACAGCTCG GTGGGATTGT GCTCGCGCTC TGGGCCGGTT TCGCATGGGT GGCCGCTAAC
CCGCAATTGA TCGGCTTGAA CGGACAGGGT TTGATTACTT CATTTACCGT CCTGATCGGT
GCGTATATGG CGATTGCACC GGCTGCCGGT GCGGGTATCG TGGCGCTGTT CAGGCTGATC
GCACGCCTGA TGATCACCCT GCCTCGTCAC GCTGCAACTG CACTCGTTGC CGTCCACCTG
GGGAGTGGTC TCCTCATTGT CGGGTGGGGA TCGTATTTTC AGGCGACGAT CCTCGATCCG
GCCTACCAAC TTGCCACCCC TGCCGATCTG AAAGCCGCAG CATGGATTCG CGATCATCTC
CCACCAGATG CAGCGGTTTT CGTTAACGGG TTTGCAGCCT ACGGTGGCTA TGTTTACGCC
GGTAGCGATG GGGGCTGGTG GTTGACCTTG CTGACCGGAC GACGAACTAA TCTGCTGCCG
ATGGCCGTCG GTTTTGAGGC AATCGATCCA CCAAACATGT TGCAGATCAT TCGCGAGCAG
CATCAGGCCG TACAACAGTT TCCCATTGGG AGTGCAGAAG CGGCAGCAGC GCTCCGCTCA
CTTGGTTTTG CGTATCTGTA CAATGGCCCG GCGGCTAATC CGCCCGGCGA ATATCTCGAT
CCGGCGCAAA TTGACGCCAC ACCGTTGTAT GAGCTTATCT ATCGCCAAGA TGGCGTGAGT
ATTTGGAGAA TCCGCTAG
 
Protein sequence
MIAGKLTSSR TTIWLAAIWL AAMISIAVPG LLWWLPWSFF STPLVVISGG MLFTLPGLAL 
LRWLHPSPLH GFERLAYSAS LSCAVLPLIL LFSEPIGWRW NGLSAWLVIG GCAVLALWPQ
TSAIRRTAPA SGNWRQAALP IQPRHHLIGW VLIFLTGAAC AVRLFLVRDV PLGFYGDSYH
HTVITQLLID HGGLFRSWEP YAPAVTFTYH YAFHAMGAWW HWLAGIPATQ AVIWTGQVMN
GLAVPLMYLL STRLTNSRLT GLWAAVIVGF VSVYPAYYVN WGRYTQLAGQ TVLPAAAMAW
LTVIDGALHL QRSWRQLAHQ LVLAIITGAG LAVSHYRVAI LAICLLVAYT VTVLVTAWPI
ERRSLIRFLT VGAIGAIGAL LLALPWLWRV REGQITRLAT NLVLNNSEAS NPFSPETVGA
AFQHGLFPLA ALGLGSLLWR RQLGGIVLAL WAGFAWVAAN PQLIGLNGQG LITSFTVLIG
AYMAIAPAAG AGIVALFRLI ARLMITLPRH AATALVAVHL GSGLLIVGWG SYFQATILDP
AYQLATPADL KAAAWIRDHL PPDAAVFVNG FAAYGGYVYA GSDGGWWLTL LTGRRTNLLP
MAVGFEAIDP PNMLQIIREQ HQAVQQFPIG SAEAAAALRS LGFAYLYNGP AANPPGEYLD
PAQIDATPLY ELIYRQDGVS IWRIR