Gene Cagg_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3830 
Symbol 
ID7266310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4667588 
End bp4669192 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content60% 
IMG OID643568641 
Producthypothetical protein 
Protein accessionYP_002465101 
Protein GI219850668 
COG category[R] General function prediction only 
COG ID[COG1106] Predicted ATPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.531313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000259973 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGAGC GTATCGTTAT TCATCGCTTC CGTGGCATTC GCCAGGGCGA TCTGAACCAT 
CTGCGGAAAT TCAATCTGTT TATCGGACCA AACAACAGCG GCAAGACCGC CATCCTCGAA
CTGCTCTACC TCAGCGCGAC GAGTGGGCGA CCGGTTCAGT TCATCCGTGA CGATCTGCTG
CCTGCCGAGA CCGGTGTGCT CAGGGCGACC ACCTCGGCGC GCACCGATCT GCTGGGCTAC
GAGCCGCTGC CATACCTGCG CCAACGTCAT GGCAAGCACG GCGAGTGGGC CGGCAATCCG
GCGGTGGTGA CACCAGAAGG CGGGTTGGAG ATCAACTTGC GCCGTCTGCC GAATAGCGAT
GGCGCACCTC CGTGGAACTC CTTTCGGCTG GCCGCACCGC TGCCGGACTG GGGCGAGCCG
GATGTGTACG CTTTTCGCAA GGAAGATATT GCCCGCATTG CGATGTTTAC CCTGCCACAG
CCAACGACGC TCGATCCCAG CATGATTCCA CCCGCGATTG CCGCAGCCGG GATCATCCCG
ACCGGCGCAG CCACCGACAC GACCACCGCC GCACCGACAC CAACAACCGA TACAGCGACC
GAAGCAGAGG AGTTGGGCAG CGCAGCCACC GACACGACCA CCGCCGCACC GACACCAACA
ACCGATACAG CGACCGAAGC AGAGGAGTTG GGCAGCGCAG CCACCGACAC GACCACCGCC
GCACCGACAC CAACAACCGA TACAGCGACC GAAGCAGAGG AGTTGGGCAG CGCAGCCACC
GACACGACCA CCGCCGCACC GACACCAACA GACACGACGC CGATCTACGA TTGGCACTAC
CTCTGGGAAC CGGACTGGGT GTACCGTTGG GATCGGCAGC AACCCATTGA TCGCCTGGCG
GTCTGGGTCA CGCAAGGACG GCGACCGCAG CCGCAGCAGG TCGTGTTCTT TTCTTCGCAG
ACGGCGAATA GCCATTTCAC CGACCACTTT GCCAAGTGGG CCTATCACCA TGTCAAGGAC
TGGCACGAAA CGCTTGCCGG GTTGATGGCG CAGGTGTTTC CGGCACTGGA GGGGGCCAAG
ATTGAGGTGC TTGACGCGCC TGACGACCAA CCGGGCCGAA CCGGCTATGT GCGCTTTCCG
AACCGAACGC CGCTGGCCAT CGATCAGTTC GGTGACGGCG CCCGTCATGC GTTCAAGTTG
CTGGCTGCCC TCACCGCCTT AGCCGCGACG GTTGATGACG ATCATCCCGG CTTGCTCTTG
TGGGAGGAGC CAGAGGTGTA TATGCACGCG GCAACCCTCA ACCGTCTGTT ACGCATCGTA
GCCGATATTG TTGCTCAAAA ACCAATTCAG GTATGCATTA CCACTCAGAG TCTGGAAGTT
CTGGCGTGGC TGATTCTCTA TCTTGATCAA CAATCGGCTA TGCAACCGGA TCAGATCAGC
ACGTTTCATC TCAACCTGAA GGATGGACGG TTGCATGTGC GTCCATTTAT TGGCAAAGCG
CTCGGCGGAT GGTTCGATTT CTTTGGTGAT CCGCGCCTGA TTGAAGAAGA CGAACTGGCT
TCACCACTGA CACGCCTGTT GAGCATTCGG GAGGAACGTG AATGA
 
Protein sequence
MIERIVIHRF RGIRQGDLNH LRKFNLFIGP NNSGKTAILE LLYLSATSGR PVQFIRDDLL 
PAETGVLRAT TSARTDLLGY EPLPYLRQRH GKHGEWAGNP AVVTPEGGLE INLRRLPNSD
GAPPWNSFRL AAPLPDWGEP DVYAFRKEDI ARIAMFTLPQ PTTLDPSMIP PAIAAAGIIP
TGAATDTTTA APTPTTDTAT EAEELGSAAT DTTTAAPTPT TDTATEAEEL GSAATDTTTA
APTPTTDTAT EAEELGSAAT DTTTAAPTPT DTTPIYDWHY LWEPDWVYRW DRQQPIDRLA
VWVTQGRRPQ PQQVVFFSSQ TANSHFTDHF AKWAYHHVKD WHETLAGLMA QVFPALEGAK
IEVLDAPDDQ PGRTGYVRFP NRTPLAIDQF GDGARHAFKL LAALTALAAT VDDDHPGLLL
WEEPEVYMHA ATLNRLLRIV ADIVAQKPIQ VCITTQSLEV LAWLILYLDQ QSAMQPDQIS
TFHLNLKDGR LHVRPFIGKA LGGWFDFFGD PRLIEEDELA SPLTRLLSIR EERE