Gene Cagg_2711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2711 
Symbol 
ID7269618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3325221 
End bp3328205 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content59% 
IMG OID643567537 
Producthypothetical protein 
Protein accessionYP_002464015 
Protein GI219849582 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases
[COG0739] Membrane proteins related to metalloendopeptidases
[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0583099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATGGT CAACCGCCCT TGCCCGCCAC TACCACATCC ACGGCACGGT AACATCACTC 
CCCGGTGAAT ATGATCTCAA CCTGCTCGTT ACTGCGAGCG ATGGCATGCA GTACGTGTTG
AAGGTCATGC GTCCGCACTG CGATCCGACA CTCGTTGATC TCCAGTGTGC AGCGTTGGAA
CACCTCGCCG TTGTTGCGCC CGATCTTCCT GTTCCACGCT TGATCCGCAC CACCACCGGT
GAGCGGTTTG TCGCCGAGCA AGATCGGCTC ATCTGGCTGA TCACCGCACT GCCGGGCGAA
CACTATGCTA CCTTCCGCCC TCGCACCACC ACCTTGCACA CCGAAGTAGG CCGGCAAGCT
GCTCGGCTCG ATCAGGCACT GGGTAGCTTT ACCCATCCCA CCCTGCACCG ACCACTCAAG
TGGAATCTGC TTCAAGCTGA TTGGGCTACC ACCGCCCTCG CTGCCATCAC CGACCCCACC
CAACGCGACC TTGCCGCTAC TGCACTTGCC GCCTACACCG ATCTGCGTCC AAACCTGCTG
CAACTACCAC TTACGATTAT TCACAACGAC CTTAACGACT ACAACCTGCT CGTCATGGCC
GATAGCTATG GTCATTTAAC CCTGAGTGGT ATTCTTGACT TCGGCGATTT GTGCGCTGCA
CCGCGCGTCT GCGAACTCGC GATCGCGGCG GCGTATGCCC TGTTTGACCA ACCCGATCCC
ATACGGACAC TAGGCGAGTT GGTTGCCGGG TACCATAGCG TCTGGCCGCT GACCGCTGCC
GAGATCGATC TGATCTGGCC GCTGATCCGC ACCCGATTGG CGGTAAGCGT GGTCAACTCG
GCGCTGATGA AACAACAACG CCCCGATGAC CCCTACGTAA CGATCTCGGA AGCACCGGCG
TGGCGCTTGC TGGCTGCAAC CGCCCACAAT AACCCCGCAC TAGTTGCTGC CCGTTTACGC
GCTATTTGCA ACTTGCCAAT ATCCGACGCT GCCGAACGGG TTATGGTCTG GCTTGATGCG
CAACGTGGTC GCTTTGCACC GGTGATCGGC TGCGATCTAG CCACGGTTCC GGTGATCAGC
CTTGCGGTCG GTGAAGCACC GCTGCCGCAG GATCCGTTCC AGCTCACGAC AACAGAAGCT
GAAGCACTGA CCGGTAGCAA CACCGGTGTA TGCATTGGCC GTTACGGTGA GCCACGCCTT
ATCTATACCG CTGCCACATT CTGGACCGCT CCCCAACCGC TGGCCGAACG CCGCACCATC
CATCTCGGCG TCGATCTGTT TGCTCCGCCC GGCACTCCGG TCTGCGCTCC ACTCGATGGC
GAAGTCGTGG CCGTCGAACA TCTCTCCGAC CGACTCGATT ACGGTGGTTT GGTCATCCTC
AAGCACTACA CTCCAAACGG TGATCCGTTC TACACCCTCT ACGGCCATCT CGAACCAATC
TGCTTCAAAC GTCTCACCGT CGGCCAGCAC ATTGCTGCCG GGCAACTGTT TGCCGCCCTC
GGTATGAACA GCAACAACGG CGGATGGCAA CCACATCTCC ACTTTCAACT GATCCTACTC
TACGACGCCG TTGCCGGACA CTGGCCCGGT GTTGCGGCTG CTAGTGAATG GACCTGGTGG
CACGCGGTTT GCCCTAATCC GGCAGCCCTG CTGAACCTCC CGGACGAGCG CGTCGCCTAC
CAACCGCTCG ATCAGGTCAG TCTCCTACAT GAACGACGCA CACACTTTGC CGGCAATTTG
CGCTTGAGTT ACCGCGAACC ATGCACCTTC TTACGGGGCT GGCGTCACTA TCTATTCGAT
GAGTACGGCC AAACGTATCT CGACGCCTAC AACAATGTTC CTCACGTCGG CCATGCGCAC
CCGCGCATCC AAGCAGTCGC AGCGCAACAA CTTCGTCTGC TCAACACCAA CACACGCTAT
CTACACCCGG CTCAGATCGC ATTTGCCCAC GAACTGCTCG CCAAACTGCC TCCATCATTG
AGTGTCTGTT TCTTCGTCAA CTCCGGCTCC GAAGCCAACG AACTGGCGTT ACGGTTAGCG
CGCACCTACA CCGGTGGGCA CGATATGATT ACCATCGACC ACGGCTATCA TGGCCACACC
ACCGGTGCAA TCGCCATCTC GGCCTACAAG TTCAACCACC CTGCCGGCAA CGGTAAGCCC
GACTGGGTAG AGGTAGTCAT GGCTCCCGAC CCCTACCGTG GCCCTTACGG CCACGATGGC
CCCCGCTACG CCGCTGAAGT TGATCAAGCC ATCGAACGGA TCGCTACACG CGGTGGCAAA
CTCGCCGGCT TCATCGCCGA GACCTTCCCC AGCGTCGCCG GACAGATTAT TCCTCCACCC
GGCTACCTCG CTGCAGTGTA TCAACGTATT CGCGCCGCCG GCGGTGTTTG CATTGCCGAC
GAAGTGCAGA CCGGATTGGG ACGACTCGGT ACGCACTATT GGGCGTTCGA GAGCCAAGGG
GTAGTGCCTG ATATCGTGGT ACTCGGCAAA CCGCTCGGCA ATGGGCACCC TATCGGAGCG
GTGATCACAA CCGTTGAGAT CGCCCGTGCC TTCGATAACG GCCTCGAATT CTTCTCGACG
TTTGGCGGCA GTACGCTCTC TTGTGTCATC GGCCGCGAAG TGCTGCGCAT CATCGACGAA
GAAGGCCTGA TGGATAACGC TCACTGCGTT GGTCAAGTCC TGCTCACCGG TCTGCGTGAC
CTACAACACC GCCATCCCGT TATTAGTGAT GTGCGGGGAA TGGGCCTATT TATCGGCGTT
GAGTTGGTTA CCGATCGTAC AACACGCACA CCAGCAACCG CGGCTGCTCG TTATGTGCGA
GAGCGTCTAC GCGCCGAACG CATCCTCATC GGCACCGAAG GTCCAGCCGA CAACGTGTTG
AAGATTCGCC CACCACTTAC CTTCGATCTC GCCGCAACCA CCGTCTTTCT GGAGCGACTC
GATGCTATTC TCGGCGAGAG TTTTATTGCA CGACAGGGAG GTTAA
 
Protein sequence
MPWSTALARH YHIHGTVTSL PGEYDLNLLV TASDGMQYVL KVMRPHCDPT LVDLQCAALE 
HLAVVAPDLP VPRLIRTTTG ERFVAEQDRL IWLITALPGE HYATFRPRTT TLHTEVGRQA
ARLDQALGSF THPTLHRPLK WNLLQADWAT TALAAITDPT QRDLAATALA AYTDLRPNLL
QLPLTIIHND LNDYNLLVMA DSYGHLTLSG ILDFGDLCAA PRVCELAIAA AYALFDQPDP
IRTLGELVAG YHSVWPLTAA EIDLIWPLIR TRLAVSVVNS ALMKQQRPDD PYVTISEAPA
WRLLAATAHN NPALVAARLR AICNLPISDA AERVMVWLDA QRGRFAPVIG CDLATVPVIS
LAVGEAPLPQ DPFQLTTTEA EALTGSNTGV CIGRYGEPRL IYTAATFWTA PQPLAERRTI
HLGVDLFAPP GTPVCAPLDG EVVAVEHLSD RLDYGGLVIL KHYTPNGDPF YTLYGHLEPI
CFKRLTVGQH IAAGQLFAAL GMNSNNGGWQ PHLHFQLILL YDAVAGHWPG VAAASEWTWW
HAVCPNPAAL LNLPDERVAY QPLDQVSLLH ERRTHFAGNL RLSYREPCTF LRGWRHYLFD
EYGQTYLDAY NNVPHVGHAH PRIQAVAAQQ LRLLNTNTRY LHPAQIAFAH ELLAKLPPSL
SVCFFVNSGS EANELALRLA RTYTGGHDMI TIDHGYHGHT TGAIAISAYK FNHPAGNGKP
DWVEVVMAPD PYRGPYGHDG PRYAAEVDQA IERIATRGGK LAGFIAETFP SVAGQIIPPP
GYLAAVYQRI RAAGGVCIAD EVQTGLGRLG THYWAFESQG VVPDIVVLGK PLGNGHPIGA
VITTVEIARA FDNGLEFFST FGGSTLSCVI GREVLRIIDE EGLMDNAHCV GQVLLTGLRD
LQHRHPVISD VRGMGLFIGV ELVTDRTTRT PATAAARYVR ERLRAERILI GTEGPADNVL
KIRPPLTFDL AATTVFLERL DAILGESFIA RQGG