Gene Cagg_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2420 
Symbol 
ID7266143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2939092 
End bp2940183 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content60% 
IMG OID643567246 
ProductHEAT domain containing protein 
Protein accessionYP_002463729 
Protein GI219849296 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACC GCGAAACGTG GCGGCAGCGG ATCGCCGAAC GCTTTAACAA CTTCGCCCGC 
AACCCTCGGC AGGAGATCCA AGTTACCGGT GTGAACACTG TGCTTGGCTT TTTAGCCGTG
CGTGCGCTTG AACCGTTCCT CGAAGCATTT CAGGACGAAC CGGTAGCAGC CGTGCTGACC
CTCGCCGAGA TTTCCCGTGG TCCCGGCGCC AACCATCTTG TGCGTCGTGC TTTCCACTGG
CGCTACCAAC TGGCACAACT GATCGAACGT GAGCTGCGTT CACGGCCTGA ATTGCGGATC
ACCGTCGAAG AGATTCTGAT GGCCTTAAAC GTGATCCATC TGGCCCGCCA ACGGTTGAAT
AGCTCACGTG ACGAGTGGCT ACGCCTAACC CTGCTCGCCG AACTTGACAC CTTTGAGCCC
GGTGATTTCG AGCAGCTTCG TCGCCAGTTG CACGACCCCG GCTGGCAAAG TCGCTATGAA
GCCATTCGCC GTCTGCGTGT GCGCGAAGGC AATTTTACCG CTGCCGATCT GGTCTTGCTC
CACGATGGGC TGAGTGATAG TGCCTCACAC GTGCGTGCAG CAGCAGCGCG TACCCTTGGT
CTCATCACCG GTACACCGCC CCAACCCCTC GTCAAGACGT TAATCCGGCT TGCGATCCAC
GATTGCGATC TGGAAACCCG CTTCGCCGCC GCACGCACGC TCGGTCAACT GCGTGACCGT
ATCGCTTCAC CGCAGTTGAT CGATTATCTG GTTGAATGCC TGGAAGATCC CGATAGCTTC
GTTCGCTCGG CAGCAGCCTT GGTGGTGAGC CAGTTGGGCG AGATGGCCGG TACCGGCCCG
GTGATCGACC ACCTCCTCGT TATGCTCAAC GATGTCGATG CCTACGCTCG TGAATCTGCC
GCGCGCGCCC TCGGTCGTCT CGGTGTCGCT GCTGCGACCT CTACCGTCCT GAATGCGCTT
GCCCAGGCCG TTGATGATGC TGACCCTAAT GTCCACGAGG CAGCAGTTGA TGCCATCGCT
CGCCTGCGGA AGCTACGCGC TACCCTACCA CTCACGCAGA GCCGCCATCC CACGGAGCCG
TTAGCAGTCT AG
 
Protein sequence
MFDRETWRQR IAERFNNFAR NPRQEIQVTG VNTVLGFLAV RALEPFLEAF QDEPVAAVLT 
LAEISRGPGA NHLVRRAFHW RYQLAQLIER ELRSRPELRI TVEEILMALN VIHLARQRLN
SSRDEWLRLT LLAELDTFEP GDFEQLRRQL HDPGWQSRYE AIRRLRVREG NFTAADLVLL
HDGLSDSASH VRAAAARTLG LITGTPPQPL VKTLIRLAIH DCDLETRFAA ARTLGQLRDR
IASPQLIDYL VECLEDPDSF VRSAAALVVS QLGEMAGTGP VIDHLLVMLN DVDAYARESA
ARALGRLGVA AATSTVLNAL AQAVDDADPN VHEAAVDAIA RLRKLRATLP LTQSRHPTEP
LAV