Gene Cagg_1323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1323 
Symbol 
ID7268614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1635929 
End bp1637269 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content60% 
IMG OID643566165 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_002462666 
Protein GI219848233 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000672343 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAGA TTGTCGTGAT CGGCGGCGGA CCGGCTGGAG TTGAGGCTGC CGTTGCCGCT 
GCAAAGGGTC ATTCGCAGGT GACCTTGATC AGTGAAGGAC CGATCGGTGG ACGTACCGGT
TGGGACAGCC TGTTACCAAG TAAAGTATGG TTGCACGCGG CTGAATTGGC CGGTGTTGCC
GAGCATACTG CCGAAGGAGT AGCAATTGGC GCCGTCCACG TGACACCAAC AGAGGTGCTC
AACCGGATCA AGCAGGTGGC TCAACGGTGG AGTGAGCGTG AGGCACAGCG GCTCCAAGCC
GCCGGGGTCA AGGTAGTCCA CGGGGTAGCT GCATTTCATA GCCCTCACGA GCTAATCGTG
CGCAACGACG ACTCGCAACA GACATTGACC GCCGATGCAG TGATTATCGC TACCGGTTCG
GTACCACGTT TCCCACCGAC AATGAAGCCC GATGGCCAGC GGATTATTGC CCCGCGGTTT
GCCAGTCATC TGAATACATT GCCGCCGGAC ATGATCGTGA TCGGTGGGGG GCCAACAGGT
AGCGAGTTTG CTTCACTCTT TAGCCGGTTG GGGGTCAAAG TGACGTGGAT CGTTGGCGCT
CCCGGCGTAT TGCCGATGTT CGATCCGGCA GCCGGAGCGG CTCTAGCCGC CGCGATGACG
GCGCATGGCG CCACGATCCA TCAGGTTGAT GTGGAACGGG CCGAGCGGAC TGAAGGCGGT
GTAGTTGTCA CTACCGCCGA TGGTGCGACC CATACCGCTG CTATGGCTTT CCTTGCGATT
GGGCGCGTCC CCGATCTGAG TCGGCTCAAT CTTGCTGCGG CCGGTTTGGA AGTTGGTGCT
AATGGACAAC TCGCGGTTGA TAACTATGGC CGTACAGTTG CCGGTCACAT TTTTGTCGTG
GGTGATGCTG CCGGTGGACC GATGCTGGCG AACCGCGCGC TGGCCCAGGC GTGGATTGCC
GGCCGCACAG CAGCGGAACT ATCGGCACCT GGCTATTGCC CGCATACCGT GGTCAGCGCC
GTCTATACCG TGCCCGAAAT TGCGCAAGTG GGGATTGTTT CTGGCGGTGA GGGTGAGTTG
CAGCGAGTGC GGGCTTCGTA TACGGCTTCG CTCAAGGCCT ACCTCAGTGA CGAGACCGAA
GGGTGGGTTG AGTTGATCTA TGATGCGGTG AACCGCCAGA TCCGTGGCGG GATTGCAGTT
GGCACACACG CAGCGGATAT GTTGGCGCCA GTTGCGTTAG CGATCCAAAC CGGCGCGACG
ATCGCTGATC TGGCCGCAGT ATTCGCTGCC TATCCCACGT TAAGTGAGGT GGTGTTTGCG
GCGGCCCGCG CAGTGAAGTA G
 
Protein sequence
MKQIVVIGGG PAGVEAAVAA AKGHSQVTLI SEGPIGGRTG WDSLLPSKVW LHAAELAGVA 
EHTAEGVAIG AVHVTPTEVL NRIKQVAQRW SEREAQRLQA AGVKVVHGVA AFHSPHELIV
RNDDSQQTLT ADAVIIATGS VPRFPPTMKP DGQRIIAPRF ASHLNTLPPD MIVIGGGPTG
SEFASLFSRL GVKVTWIVGA PGVLPMFDPA AGAALAAAMT AHGATIHQVD VERAERTEGG
VVVTTADGAT HTAAMAFLAI GRVPDLSRLN LAAAGLEVGA NGQLAVDNYG RTVAGHIFVV
GDAAGGPMLA NRALAQAWIA GRTAAELSAP GYCPHTVVSA VYTVPEIAQV GIVSGGEGEL
QRVRASYTAS LKAYLSDETE GWVELIYDAV NRQIRGGIAV GTHAADMLAP VALAIQTGAT
IADLAAVFAA YPTLSEVVFA AARAVK