Gene Cagg_3170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3170 
Symbol 
ID7269919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3847963 
End bp3849282 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content56% 
IMG OID643567991 
Producthypothetical protein 
Protein accessionYP_002464464 
Protein GI219850031 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTG CGATCATCGG TGCGGGCTTT GCCGGTCTGA GCGCTGCCTA TGACCTTGCC 
GGACACGGCT ATGCGGTCAC GATCTACGAA GCCGGCGAGC AGGTAGGAGG CTTAGCCAGC
GGTTTTCGCG ATCCGATGTG GGAATGGCCG CTTGAAAGGT TTTATCACCA TATCTTCACC
ACCGATCACG CCATTATTGC CCTAACTAAC GAGATCGGCC TGCGTGACCG GCTCTTCTTT
CGCAGTCCGG TGACGGCTCA GTGGTGGCGA GGACGCGGTT ATGCCCTCGA TGGCGTGCTG
CCGGTGTTAC GGTTTCCCGG ATTACCCTTC ATCGACCGGT TACGGTTTGG GTTGACCGCG
TTTTACCTCA AGTATCTTAC AAACAATTGG CAGAAACTCG AACAAACCAC CGCTACAGTC
TGGATCGAGC GCTTTTCAGG CAAGAACGCA GCCCAAGTGA TCTGGCATCC GTTACTCGAA
GGGAAGTTCG GCCCTCACGC CGATGAGGTC AATATGGCGT GGCTCTGGGC ACGACTTCGC
GCACGGAGCT TTCAGTTGGG CTACTTTGTC GGCGGTTTTC AAGCATTCGC CGACGCATTA
TGTGCCGTAT GCCTCCGTCG CGGCGTACAG GTTGTACTAC GGACACCGGT GAAAGCCGTT
CACCAAACTG CCGGCGGTTG GCAGGTACTG GCCGGTGATC ATCCGCCAAC CGACTATGAT
GCTCTGCTCG TTACCGGATC GCCAGGACTG TTGGCACGAC TGGTTCCTCA TCTCCCAAGC
GGCTACCTCG GCCAATTACG CCGGTTGCGT TCGATGGGAG CAGTTGTGAT GACCATCGCT
CTGCGTCAAC CACTTACCAA CGGTATCTAC TGGCTCAATC TACCGAAGGA TGAATTCCCG
TTTCTGGCGC TGGTTGAACA TACGAACTTT ATTGAACCGA GTCACTATGG CGGTGATAAT
CTGATTTACT GCGGTGATTA TCTCGATCCC GACCACGAGT ATTTTCGGCT CACCCCAGAA
GCGTTGCTCG AACGCTTTGT GCCGGCACTA CGCCAGATCA ACCCCGCCTT CCAACGCGAA
TGGGTACGAG CCTATTGGCT ACACCGCGAA CCCTACGCTC AACCGATTGT GCCGGTTAAC
CATAGCCGCA ACATTCCGCC CATCCCAACT CCCCTACCCG GCCTCTATTG GGCTAGCATG
AGCCAGGTAT ATCCGTGGGA TCGCGGCACC AACTACGCAG TCGAATTGGG CCGACGGGCG
GCACGGATCA TGCGCGAACA CCTGTACCCT AAGCCACAGG CGGTACCATC AGGCTATTGA
 
Protein sequence
MKVAIIGAGF AGLSAAYDLA GHGYAVTIYE AGEQVGGLAS GFRDPMWEWP LERFYHHIFT 
TDHAIIALTN EIGLRDRLFF RSPVTAQWWR GRGYALDGVL PVLRFPGLPF IDRLRFGLTA
FYLKYLTNNW QKLEQTTATV WIERFSGKNA AQVIWHPLLE GKFGPHADEV NMAWLWARLR
ARSFQLGYFV GGFQAFADAL CAVCLRRGVQ VVLRTPVKAV HQTAGGWQVL AGDHPPTDYD
ALLVTGSPGL LARLVPHLPS GYLGQLRRLR SMGAVVMTIA LRQPLTNGIY WLNLPKDEFP
FLALVEHTNF IEPSHYGGDN LIYCGDYLDP DHEYFRLTPE ALLERFVPAL RQINPAFQRE
WVRAYWLHRE PYAQPIVPVN HSRNIPPIPT PLPGLYWASM SQVYPWDRGT NYAVELGRRA
ARIMREHLYP KPQAVPSGY