Gene Avin_52350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_52350 
Symbol 
ID7764070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5349866 
End bp5351548 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content53% 
IMG OID643808048 
Productadenine specific DNA methylase N-4/N-6 
Protein accessionYP_002802282 
Protein GI226947209 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAC AAAAACTCGA ACTCACATGG GTTGGAAAGG ATAAGCGGCC CAAGCTAGAG 
CCGCGCATCC TACTTGAAGA TCCCGAAAAA TCTTACCATG CCAAGCAGCG TGTTTCAGAG
AACGACTTCT TTGATAATCA GCTGATTTTC GGAGATAACC TGCTGGCGTT GAAGGCGCTG
GAGCAGGAGT TTTCTGGAAA GGTAAAGTGC GTTTTTATTG ACCCGCCTTA CAACACTGGG
AGTGCCTTCA CGCATTATGA CGACGGGTTG GAGCACTCCA TCTGGCTGGG ACTAATGCGG
GATCGACTGG AGATCATCAA GCGGCTATTG TCGGACGATG GTTCATTATG GATCACCATT
GACGATAATG AATGTCATTA TCTCAAGGTG CTATGCGACG AAGTATTTGG AAGAAATAAC
TTTGTTAGTA ATTTGATTTG GGAGAAAGCG GATTCACCTA GGAATTCTGC CCGTCAATTT
TCGACCGATC ATGACCATAT TTTAATTTTT TCCAAGAACC CTGATTGGAT TCCTAAAAAA
CTTCAACGCA CGGAACAAGC CAACTCCATA TATTCGAACC CAGATAACGA TCCACGTGGC
CCTTGGCTTC CCGGCGACCC CTACGCAAAC AAGCCGTACT CCAAAGGCCA ATACACAGTT
ACTGGGCCTA CAGGGAGGGA TTTCTCACCA CCTCCTGGAA GATATTGGCG TATTTCAGAG
GAAAAACTTC AAGAGTTAAA CACCGATGGC AGAATTTGGT GGGGGCCAAA TGGATCTGCT
CGACCAAGCA TTAAACGATA TCTTTCTGAG GTAGGGGATC TTGTCCCAAG AACCTTATGG
TCCAAAGAGG ATGTTGGAAG CAACCGTACA TCCAAGAATG AAATGCGGCT CCTTTTTCCA
GGAGATAGCT CCTTCGATAC GCCCAAACCT GAGCGCCTCA TAGAGCGAGT ATTGAATATT
GCCACCAGTC CCGGCGACCT AGTCCTTGAC TCATTCGCCG GTTCCGGCAC CACCGGCGCA
GTTGCCCACA AAATGGGCCG CCGCTGGATC ATGGTCGAAC TCGGCGAGCA CTGCCATACC
CACATTATTC CACGTCTGAA AAAGGTCATC GACGGCGAAG ACCCGGGCGG CATCACCAAG
GCAGTGGACT GGCAAGGTGG CGGTGGCTTC CGCTACTACC GTCTCGCCCC TAGCCTGATC
GTGGAGGATC GCTGGGGCAA TCCGGTCATC AACCCGGAAT ATAACGCCGC TCAATTGGCC
GAGGCATTGT GCAAGTTGGA AGGTTTTGCC TATGCGCCAT CGGAAACCCG CTGGTGGCAG
CAGGGACATT CCAGTGAACG GGACTTTCTC TACATCACCA CGAAAAACCT GTCTGCCGCC
CAGTTGCAGG CTTTGTCGGA TGAAGTGGGC ACCGAACAAA GCCTGCTGGT GTGCTGCTCG
GCCTTCCACG GTATCAGCGC AGCAGCGGCC GCTGCGCGCT GGCCGAACCT GACGTTGAAA
AAGATTCCGA AGATGGTACT GGCCCGTTGC GAATGGGGCC ATGACGACTA CAGCCTGAAT
GTGGCGAACC TGCCGCTGGC CGAGTCGTCG CCGCCAGCAC CTGCTGCGAA GGCAGCCAAG
TCCGGCAGAA AGTCCAGCGA TAGTCGGACG ACGGACATGT TTGGCGATGG AGGGGACGCC
TGA
 
Protein sequence
MSKQKLELTW VGKDKRPKLE PRILLEDPEK SYHAKQRVSE NDFFDNQLIF GDNLLALKAL 
EQEFSGKVKC VFIDPPYNTG SAFTHYDDGL EHSIWLGLMR DRLEIIKRLL SDDGSLWITI
DDNECHYLKV LCDEVFGRNN FVSNLIWEKA DSPRNSARQF STDHDHILIF SKNPDWIPKK
LQRTEQANSI YSNPDNDPRG PWLPGDPYAN KPYSKGQYTV TGPTGRDFSP PPGRYWRISE
EKLQELNTDG RIWWGPNGSA RPSIKRYLSE VGDLVPRTLW SKEDVGSNRT SKNEMRLLFP
GDSSFDTPKP ERLIERVLNI ATSPGDLVLD SFAGSGTTGA VAHKMGRRWI MVELGEHCHT
HIIPRLKKVI DGEDPGGITK AVDWQGGGGF RYYRLAPSLI VEDRWGNPVI NPEYNAAQLA
EALCKLEGFA YAPSETRWWQ QGHSSERDFL YITTKNLSAA QLQALSDEVG TEQSLLVCCS
AFHGISAAAA AARWPNLTLK KIPKMVLARC EWGHDDYSLN VANLPLAESS PPAPAAKAAK
SGRKSSDSRT TDMFGDGGDA