Gene Cagg_2974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2974 
Symbol 
ID7266505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3645199 
End bp3646320 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content58% 
IMG OID643567796 
Producthypothetical protein 
Protein accessionYP_002464270 
Protein GI219849837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000254012 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGGTGC AACCTTTTAA TCCTGTTGGA ATATTGGCCG GGCTAGCCGA ACAGCTTGGC 
CCGCGCCGTG CCACCTCTAC AGCAGAGGCA GTGGCAGCAG CTCAGATCAA TGCCCGCCTG
CGACAGGCGG GCTTTAGCGT GGACACTCGT TCGCTCCCGG CAGTTGATCA TCCGGGGGTG
CGTTTTAGAC CGGTAGCTGT ATGGTTGATC GCCGTTACGG TGGTAAGTGG TTGGCTTCCG
GTGGCCGGTA GTGTGCTCGT CCTCTGGCTG GTGGCCGTCT TGTGTGCCGA TGCTTTGGCG
ACGCCACTAC CGGTGTGGCG TCGGCAGCAT ATGAGTCAGA ATATTATTGC AGCCCTACCC
ATCGCTGTTA CCGAAATCGG TACGCCGAGT CAACCACGCT GGCGTTTAGT TATCGTCGCA
CCGCTTGACA CCCCCCCAGT ATGGCGCGGC ATATCCCACT TGATCGCGCC AACGACAGAA
GGTTTGTTGA TACGGCTGGT CTTCACCAGC TTACCGCTCG TTAGCACGGC TACTGCGGCC
TGGCCGATCT GGCGGTGGTC GTTGCTGATT GTTGCGCTGC TCGGCGCCCT AGGTTGGCTT
TGGGCTACCT ATCGACCAAT AGAGCTTATG GAGCCTGACG GTGGGATCGC GGCGCTGGCG
GCGCTGTTGA TTGCCGGTCA TCATCTCCAC GGGTTACGTC ACGTGGAGGT GTGGGCGGTG
ACTATCGGTG CTGCCTATTG TGACCAACAC GGTATCAAGA CGTTACTTAC CCGTTACCCG
TTCGATCCGC GGAATACGTT CGTGATCGGC TTAGGACCGT TAGCATGCGG TCAATTGGCA
ATTATCAGCC GTGACGGTGT ATTGCGACAC GAACGGGCCG ATCGGTTTTT GTATCAACTG
GCGATGATGG CGGATCAGAC CGATCCGGCG ATCGATCTCG AACCACGCAC GTTGGCAGTC
CGTGACGAGC TGCTCGCACC GTTTCGGCAA CGACATTTTC GCACGTTGAG TATCCGCGCT
ATCGCCGATA GCAGATGTGA GTACGATCCG TCGTTAGCCG AACGGGCGGC ACGATTAATC
GGTACGATTG CCCGTACCCT TGATAATGAG CCAGAGCGAT GA
 
Protein sequence
MPVQPFNPVG ILAGLAEQLG PRRATSTAEA VAAAQINARL RQAGFSVDTR SLPAVDHPGV 
RFRPVAVWLI AVTVVSGWLP VAGSVLVLWL VAVLCADALA TPLPVWRRQH MSQNIIAALP
IAVTEIGTPS QPRWRLVIVA PLDTPPVWRG ISHLIAPTTE GLLIRLVFTS LPLVSTATAA
WPIWRWSLLI VALLGALGWL WATYRPIELM EPDGGIAALA ALLIAGHHLH GLRHVEVWAV
TIGAAYCDQH GIKTLLTRYP FDPRNTFVIG LGPLACGQLA IISRDGVLRH ERADRFLYQL
AMMADQTDPA IDLEPRTLAV RDELLAPFRQ RHFRTLSIRA IADSRCEYDP SLAERAARLI
GTIARTLDNE PER