Gene Cagg_3500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3500 
Symbol 
ID7266428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4265848 
End bp4266888 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content53% 
IMG OID643568308 
Producthypothetical protein 
Protein accessionYP_002464775 
Protein GI219850342 
COG category[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAC CATTTGTCAA CGCACACAAA GCCGTCCGTC AGAACTGGAT TCAGTATGTC 
GTTGCTATCT GCGGCTTAGG GATTGTCTTA GGGTTGGCCT TAAGCTCTAT CGACCTGCAC
GAGCTGCATC AAGTACTGAC GACTGCTAAT CCATGGTGGC TGACAGCCGC CGTCATCTGT
AAAGTCCTTA CACCGTTAGG CACGGCGACG CTGTATGCCG GCGTCCTTCG CATGCTCGGT
CATCACATCC GCGCGATCAG TCTCTGGTTG ATTGCACAAA TGGCGATTGT GATCAACATG
GCATTTCCGG CCGGTCCGAT GGCGATGAGT GCCTTTCTCC TCCACGTCTT TCGCCGCCGA
GGTGTACCGG AGGGCATTAC CACTATCGCC GTCGTCATCG ATTCACTGAC GTATGAGACG
ACGTTCTTTG GCTTAGTTGG TTTTGGACTG GCCTATCTTC TGATGCATCG CGATCTCAGC
GTGAGTCAAA TTACCGAAGT TGGGATCATT GCGCTAATCA TCGTTATCAC CGGAATGTAT
CTCTGGGGAT TACAGCGTGA TCGTGCCGAT TTCACCCGCA AAGCAATTGC TGTTCAACAA
TGGCTGGCCC GCCTTTTGCG CCGGCAGTGG CGACCAAATC AGGTTGAACA GTTTCTTGAC
GAATTGTACC GTGGAAAGGC ACTTGTCGCT CGTCAACCAA AAACATTTTC ACGGTTACTG
GGAATTCAGA TTGCTGTTCT GTGCCTCGAT ATCCTGACGC TCTACTGTGC CTTTCGCACG
GTTGGGAGTG ACCCGCACCT ATCGGTCGTG ATCCTGAGTT ATAGCCTCGC CAGTCTTTTT
GCGACGCTGG CACCCCTGCC CGGCGGCGGT GGCTCGTTTG AAGCAACCCT TGTCTTGGTT
GCATCACGTC TTGGCATTTC CCCCACTGTC TCGTTAAGCG CGACGCTCAT CTACCGGATT
TTGACCTTCT GGCTACCCGG CTTGCTGACC ATTATTATGT ACCGTCTGCT CAAACCGACA
TCATCGCAGA CCCATACGTG A
 
Protein sequence
MAEPFVNAHK AVRQNWIQYV VAICGLGIVL GLALSSIDLH ELHQVLTTAN PWWLTAAVIC 
KVLTPLGTAT LYAGVLRMLG HHIRAISLWL IAQMAIVINM AFPAGPMAMS AFLLHVFRRR
GVPEGITTIA VVIDSLTYET TFFGLVGFGL AYLLMHRDLS VSQITEVGII ALIIVITGMY
LWGLQRDRAD FTRKAIAVQQ WLARLLRRQW RPNQVEQFLD ELYRGKALVA RQPKTFSRLL
GIQIAVLCLD ILTLYCAFRT VGSDPHLSVV ILSYSLASLF ATLAPLPGGG GSFEATLVLV
ASRLGISPTV SLSATLIYRI LTFWLPGLLT IIMYRLLKPT SSQTHT