Gene Avin_36050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_36050 
Symbol 
ID7762499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3672551 
End bp3674056 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content53% 
IMG OID643806472 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_002800727 
Protein GI226945654 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCATC GGAGGAGAAC AGCCATGGAA AGCAATGAGA AATTGGCGAA GTGTAAGCTC 
GAAAGGCTGG CATCCGGAGC GATTCCGAAG GTACTCGAAT TATGCTCGGG TTGCGGTGGG
CTGTCGCTAG GTTTGAAAAC TGCAGGCTTT GAGCTTGCAG CTCATGTAGA GAGCAACGAT
GAGGCCAATG CCACATACGC ACTGAATTTC GCTCCGGAAA ATCCTGCGCA GACGAAGCAG
TGGGCTATCT CCCGTGACAT GGTGGCTCAA TCAATGAGTG ATCTCATCAC GGATTTCGGA
TTGGCCGGAG GCCCGCGCGA AGCTTTCGAT GTCCTGGCTG CAGGGCTGCC ATGTCAGGCA
TTTGCCCGTA TAGGAAGATC CAAGCTTCGA TCAGTGACGG GAGATGAGGA TGCATTCAAG
AACGATCCAC GCGCATCCCT TTATCGCCGC TTTCTGGAAA TTGTGGACGA AACTCGTCCT
CTGGCCATTC TCGTTGAGAA TGTTCCGGAT ATCATGAATT TTGGCGGCCA CAATGTACCT
GAGGAAATCG CAGAAGGGCT CAGGGTTCGT GGCTATGTTA CTCGCTACAC CCTGCTTAAT
GCAGCGTTCT ATGGTGTGCC TCAACTTAGG GAAAGACTCT TTCTTGTTGC TGTTGACGCC
ACTCTTGATG TGATTCCGCA GTTCCCTTCG CCTACCCACT TCATGGAGTT ACCGCGTGGC
TATGAAAGTA GCCGTGCTGT AGCTCTCAAA CACGTCAAGG ATGTGGGTTC GCACTTCTCG
CCCATCCCTT CCCCTGCAGG CGGGCTGCCT TCTTCGATAG GTACCGAATC GGCATTGGCT
GACCTTCCCT TCATTTCTGA CCATCTGAGG GATACGGCGA TCATCAGGAA GCGGAAGGTT
GCAGACAAGC TGCCGTATCG TGAAGGCATT ACACCTTCCA CGTATGCTCA CCTGATGCGC
GACTGGCCCG GATTTTCTGC TTCCGAAAAT GTGAGCGGCA ATGTCGTGCG CATTACCACG
CGTGACTTTC CGATTTTTGG TCGCATGCCC CGTGGAGCAG ACTATCCTGT TGCACTCCGC
ATTGCACAGC AACTCTTGGA AGAAAAACTA CAGCGGGAAA ATTTTCCGCC ACGGCCAGGA
ACTATTCGTT ACAAGGCTCT GGAGAAGGCG ACGATTCCTC CTTACGACGC AAGCAAGTTT
CCCAACAAGT GGTGGAAGCT TGATCCGGAC GCTCCCTCTC GAACCCTGAC GGCCCACCTT
GGCAAGGACA CGTATTCGCA CATCCACTAT GACGGACGCC AGAAGCGCAT GATATCTGTC
CGCGAAGCAG CACGACTTCA GTCATTTCCC GATGGATTCG AGTTTGCTGG AGCCATGAAT
GCGTCTTTCC GTCAAATTGG CAATGCGGTT CCTCCGATGC TTGCACTCGC CGTCAGCAAG
GCACTTATGG AGACTATTGA GCAGGCGATT GCAGGACGGG ATTCTGCCAA TCGTCGAGTA
GCCTGA
 
Protein sequence
MKHRRRTAME SNEKLAKCKL ERLASGAIPK VLELCSGCGG LSLGLKTAGF ELAAHVESND 
EANATYALNF APENPAQTKQ WAISRDMVAQ SMSDLITDFG LAGGPREAFD VLAAGLPCQA
FARIGRSKLR SVTGDEDAFK NDPRASLYRR FLEIVDETRP LAILVENVPD IMNFGGHNVP
EEIAEGLRVR GYVTRYTLLN AAFYGVPQLR ERLFLVAVDA TLDVIPQFPS PTHFMELPRG
YESSRAVALK HVKDVGSHFS PIPSPAGGLP SSIGTESALA DLPFISDHLR DTAIIRKRKV
ADKLPYREGI TPSTYAHLMR DWPGFSASEN VSGNVVRITT RDFPIFGRMP RGADYPVALR
IAQQLLEEKL QRENFPPRPG TIRYKALEKA TIPPYDASKF PNKWWKLDPD APSRTLTAHL
GKDTYSHIHY DGRQKRMISV REAARLQSFP DGFEFAGAMN ASFRQIGNAV PPMLALAVSK
ALMETIEQAI AGRDSANRRV A