Gene Cagg_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1333 
Symbol 
ID7268624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1650768 
End bp1652285 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content55% 
IMG OID643566175 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_002462676 
Protein GI219848243 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00265785 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAACTGT TTGTCGACTC GCTGCGGCGC AACATGACCG CCGAAGACGT GGTGAACCTC 
GAAGCATGCA TGAACTGCAA GATGTGCGGG GAGGCGTGCG CGTGGTATCT CGTGACCGGC
GATGAAAAGC TCCATCCAAC CCACAAGACC GGTTTTCTCC GCCAGATTTA CCAGCGCTAT
CTGACGGTCG AAGGGCGGAT CGGTGGTGCG CTTGGTCTCG TGCCGACACC CACCGTTGCC
GATCTGAAAG AGAATATGCA GTATTTCTGG GCATGTACAG CTTGTGGGCG CTGTACGTTG
GCTTGTCCGT CCGGTATCAG CATTCGCCGC ATGGTGCGTC TAGCCCGTGC CGCCTACACC
GATTCCGGTT TGAGCCAGAC AAATCCGACT ATTCGTTCGA TTATCGAGAA TACCGATCGC
CATCGACACA GTTTTGGTTT AACCGCTGCA CAGGTCCTCG GACGAGTCGG CCTCTTCTTG
CGCAGTGAAG GACTGGAAGT GCCGGTCAAT GTGTCCGGCG CCGAACTGCT CTTTGTTTGT
CCGGCTGCCG GAAATACCAA AATCCCCGAT TACGGCATCA AACTCATTAA AATTCTTAAC
GCCGCCGGTG TCAGTTATAC CATTTCACCT TATGTTATCG ATACCGGTAC TGAAATTGAT
CATATTGCTG TTCATCACAA CCTGTCGAAG CAAATGTTGT TGGACTGGGA GGAGGAAGCC
GATCGGTTAG GTGTGAAAGC GATCCTGCTG GTGGAATGTG GCTGCGATAC GCGCACTCTA
TACGCCGAGG CAACCGAAAC GCTTGGTCGC CCCTTCCGCT ACCCGATTAT CAGTGTTGAT
TCACTGATGC TTGATCTAAT CCGAGAAGGA CGGTTACCGG TTGAAAAGAC CCAGTTGAAG
GTAACCCTGC ACGATCCATG CTACGCAACG CGCCTCTCTG GGTTGGGTGA TCTGTTCCGC
GAGCTGCTGA ATCTGGTTAC CGATAATTTC ATCGAGATGA CGCCAAACCG CGAGCACAAC
TACTGCTGCA ACGGTGGGGC CGGTGGCATG CGGTTGCCGG AAAACACGAA TCTACGGCGC
AAGATCTCGG TGCTGAAGGC AAACCAAATT CGCGCTACCG GTGCAGATTA TGTCACCTCA
CCGTGTGTGG TTTGTACGTT ATCGCTGGAA GACACCTGCC AGACGTACAA TCTCTCGCCC
ACCGGCGAGC GGATGGCGCT GGTGCTGTTC GAGGTCGTGT ATGCCGCAAT GGAGCCGGCG
CTGGCGAAGC GCGGCGAACT CGACCGGATG CGCGTCCCTG CGGAGCTGCG ACACCGCGAT
CATGAGTTCT TTGTCGCACA TAGTATCGAG GGCCAGATTG CGACACTGAT GCAGCAACCC
GATTTTCCGG CTTTGCTCGA GTGGCTGGAG AAAGACGATA TTGTGAAGCG ATTTAGCAAA
GATCATCCGC AGGTCTACGA TCTCCTCCGA TCGTGGCGGG AGTTTGCGAT GTCGCTCGAT
CCGGAGTGCT GTCGGTAG
 
Protein sequence
MQLFVDSLRR NMTAEDVVNL EACMNCKMCG EACAWYLVTG DEKLHPTHKT GFLRQIYQRY 
LTVEGRIGGA LGLVPTPTVA DLKENMQYFW ACTACGRCTL ACPSGISIRR MVRLARAAYT
DSGLSQTNPT IRSIIENTDR HRHSFGLTAA QVLGRVGLFL RSEGLEVPVN VSGAELLFVC
PAAGNTKIPD YGIKLIKILN AAGVSYTISP YVIDTGTEID HIAVHHNLSK QMLLDWEEEA
DRLGVKAILL VECGCDTRTL YAEATETLGR PFRYPIISVD SLMLDLIREG RLPVEKTQLK
VTLHDPCYAT RLSGLGDLFR ELLNLVTDNF IEMTPNREHN YCCNGGAGGM RLPENTNLRR
KISVLKANQI RATGADYVTS PCVVCTLSLE DTCQTYNLSP TGERMALVLF EVVYAAMEPA
LAKRGELDRM RVPAELRHRD HEFFVAHSIE GQIATLMQQP DFPALLEWLE KDDIVKRFSK
DHPQVYDLLR SWREFAMSLD PECCR