Gene Cagg_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1087 
Symbol 
ID7268539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1344671 
End bp1346035 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content38% 
IMG OID643565931 
Producthypothetical protein 
Protein accessionYP_002462436 
Protein GI219848003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000322584 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000189736 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAACGA CAAAGAAATT TTTCGGTATA TTCCCCAAAA TATTATTCTT TCCAGTTCTG 
TTTGTTATTA TGGCGTTGCC TCAATTGCTT TCCAAATCAC CTTCGCCATT TTTCAGATCT
ATTAAGCAGC TTTTGGAATC ACAATCTATT TTCACGATAT TACTCATCGT TTTTGGAATA
CTATTACTGA TAGTACTGAT AGCTGTAGTT GCTGTTCTGA CAGATGATGA GTGGAAAGAA
AACATGAAAA GAAAAGTTGA CACTACCGAA GCAGTTAAAC GCGAGCGAGG GGGAAACGGA
ATCTATACAG ACCCGATACC GTTCATCTGG ATATTTTCTG TCTTAACTTA TGTCTATGTA
GAGACAGGTA AAGAATGTGC TATTGTGGAC GGAGAGCGCA TTATTGAAAC GAAACAATGG
ATAACCAAAT CTTTCTTCCA CGATGCAGTT ATCCGGTATG TTTTACTTGA GCCACCACCT
CTAATTATGA CTATCGAAAA TGTTCGCACG CAAGATGATT TGTACCTCAC TGCCGATATC
TCTGTTACAT ACAGAGTTCG AGATCCACTT GCTATTATCA AGAAAGCTGA CCCTCTGAAA
ATTCTACAAG AGCATGTTAA ATCACAATGT ACTAATTTAA TAGGGCGACT AGACTATTAC
ACTATCGCGG ATAAAAAAGC CAAATATGAA AACGAGATAT GTGCAGCTAT CCAGCGAGAA
AGCATATTAC CACTTTTTGA GATTACCAGT GTTCATCTTG AGATGAAATT AGCGGTTGAC
CCGCGAAACG TTGAAAAGAT TGAAGAACTG AAGAGACAGC AAAAAGAGAA GGATCTTCTG
GTTGAAGATA GACAGAAAGA GCGTGATCAT GCCCGTGAGA TAGATAAAGC GAAAATCGGG
GCAGGTTTAA CCATTATTAA AGAGACGGCT GAAATTGCAA AAGGAAGATC GGAGTCCACC
AAAGAATACA TAGATTCTCT AGGTCCTAGG GCTATATTAT CTATGATAGG AACTCCTTAT
CATAACTTAG AATCGGCTCC AAATCTACCT ATGTTGACCG AAGCAACAGA AAGGTCGAGA
TATGAACGTG AAAAGCCTGA ACTCGATAAT CTGCAAGAAA ACGGTGTAAT CAAAAACTAT
GAATCTCGAT GGAGTAAAGA TGGAAATTTC TGTGGCGTGG TGATTGAGGT GGATGATGGA
CAAATTCATA TTTTGTCCCC AAGCTATCCA GATATTGCCC CGACTATTCA ATTCATTTCT
AGATCAGGTC AAACTTATGA ATGGCCTATT GAAAAATGGA ATGCTAATAT GACTATCGTG
CATGTTATCA CAATTGCACT CAGTAAAATA AAACTTCTTC AATGA
 
Protein sequence
MKTTKKFFGI FPKILFFPVL FVIMALPQLL SKSPSPFFRS IKQLLESQSI FTILLIVFGI 
LLLIVLIAVV AVLTDDEWKE NMKRKVDTTE AVKRERGGNG IYTDPIPFIW IFSVLTYVYV
ETGKECAIVD GERIIETKQW ITKSFFHDAV IRYVLLEPPP LIMTIENVRT QDDLYLTADI
SVTYRVRDPL AIIKKADPLK ILQEHVKSQC TNLIGRLDYY TIADKKAKYE NEICAAIQRE
SILPLFEITS VHLEMKLAVD PRNVEKIEEL KRQQKEKDLL VEDRQKERDH AREIDKAKIG
AGLTIIKETA EIAKGRSEST KEYIDSLGPR AILSMIGTPY HNLESAPNLP MLTEATERSR
YEREKPELDN LQENGVIKNY ESRWSKDGNF CGVVIEVDDG QIHILSPSYP DIAPTIQFIS
RSGQTYEWPI EKWNANMTIV HVITIALSKI KLLQ