Gene Avin_20500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20500 
Symbol 
ID7764124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2041872 
End bp2043071 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content60% 
IMG OID643804947 
ProductPhage integrase 
Protein accessionYP_002799228 
Protein GI226944155 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGAA AAACCATCAC TGGCCTCTAC GAGAGAAATG GAATCTGGCA TGTCGACAAG 
GTCGTCAGAG GTCAGCGACT TCAAGAAAGC ACTGGAACAG GCAACCGGGA GGAAGCAGAA
CAGTACCTGA TACACCGGCT CGAGAAGCTG CGAGAGGAGA AGGTCTACGG TATCCGCCGG
ATCAGGAGCT GGCGGGAAGC CGCTACCCGC TACCTGGTGG AGTACAAGGA CATGCCGTCG
ATCGGCTTGG CCGCCACCTA CCTGGAGCAG CTGGATCCCT ACATCGGCGA CCTTCCGATC
ACACACGTCG ATGACGAGTC GCTTGCTCCA TACATCAGGG ACAAGCTGAA GCCGGGCAGG
ACATCGACCG GCAAGGTGAA GCCTGGTGTA ACCCCAAGGA CGGTGAACAT CGCCCTGGAG
AAGGTCATCC GCGTTCTCAA CCTGTGCGCC CGGAAATGGC GCGATGAAGA GAAACGGCCC
TGGCTGGACA CGGTGCCGAT GATCAGCAAG CTGGACGAGA AGCGATCGAG GCGGACGCCC
TACCCGCTTT CGTGGGAAGA ACAGTCGCTG CTGTTCTCGG AACTGCCGGA CCACCTGCGC
CGCATGGCGC TCTACAAGGT CAACTGCGGT TCTCGGGAGC AGGAAGTGGT CAAGCTGAGG
TGGGACTGGG AGATACCGGT ACCGGAACTC GACACCAGCG TGTTCCTGAT TCCTTCGGAT
TTTGGAGGCA GGGACAAGGG ATCGGGCGTG AAGAACGGAG AGGAACGGCT GGTCGTGCTG
AACACTGTGG CCAAGTCGGT CATCGAGGGG CAGCGTGGCC TGGATCCGAC CTGGGTATTC
CCGTACGGGA TGCCCGACAG GAACGGCAAG GCGACACCGG TTCATCGGAT GAACGATTCC
GCCTGGAAGA AGGCGCGGGT CAGGGCGGCG AAGAAGTACC AGGAACGCTT CCTGAGACCG
GCGCCGAAGG GATTCGCCTC GATCCGCGTG CACGACCTGA AGCACACCTT CGGAAGAAGG
CTCCGGGCAG CCGGTGTAAC CGAGGAAGAC AGGCGGGCCC TGCTAGGCCA CAAGAACGGC
AGCATCACCA GTCACTACTC AGCGGCGGAG CTGGGAAAAC TGATCGATGA GGCCAACAAG
ATATCGGCGA CGGACTCACG AGGGCCGGCG CTGACGATAC TGAGGAGAAA GGCAGGATGA
 
Protein sequence
MARKTITGLY ERNGIWHVDK VVRGQRLQES TGTGNREEAE QYLIHRLEKL REEKVYGIRR 
IRSWREAATR YLVEYKDMPS IGLAATYLEQ LDPYIGDLPI THVDDESLAP YIRDKLKPGR
TSTGKVKPGV TPRTVNIALE KVIRVLNLCA RKWRDEEKRP WLDTVPMISK LDEKRSRRTP
YPLSWEEQSL LFSELPDHLR RMALYKVNCG SREQEVVKLR WDWEIPVPEL DTSVFLIPSD
FGGRDKGSGV KNGEERLVVL NTVAKSVIEG QRGLDPTWVF PYGMPDRNGK ATPVHRMNDS
AWKKARVRAA KKYQERFLRP APKGFASIRV HDLKHTFGRR LRAAGVTEED RRALLGHKNG
SITSHYSAAE LGKLIDEANK ISATDSRGPA LTILRRKAG