Gene Avin_49670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_49670 
Symbol 
ID7763820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5030828 
End bp5033953 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content59% 
IMG OID643807801 
Producthypothetical protein 
Protein accessionYP_002802035 
Protein GI226946962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGGTG GAGTGAACGT GCCCCGTTTG GACAATCAAC AGGTCGAGTA TCTATGGGCT 
GCTGGCGATC ATGAGCCGGA TTGGAGGCCA GAGCTGCGGG AGCACCTGAA AACGATAGCT
GAGGCGTCTG GAGTGCCCCT GGGGCAGTGG TTATTGGGAA AGTGTTCCAT GCGCGAGGTG
TCGGCCGAGC ACGTGCGTGA CCTGGAGGCG CAGCTTGGTA CGAATCACGC GCTACGGAAT
ATGCTGCGCA CCATCACCCA ACGACTCAGG CGGCAGATTC CAGGTTTTCC ATTGGCCCGT
ATCGCCATTG CCGTCGAACG CCCAGCGAAC CCGATCAACC CCGACATCTA CACGGCTGAA
CGACTACGCA GAAGTCGCCA ACTGCTCAAT GTACTGCAGC AGGGATTGAG GTTGGATCTC
GATTCGTTCC GCCCAGAGGA GCGCATCGGT CTGCTATTGA TGAGCGCGGC TTATGGCGGC
GGCCTGATGG ACATTGCTCA ACTGAATGCG CTGGTCGAGG TGTCCCTGGA GCGAATCGAG
TGGATCGCAG GCATTCCGGA GCTGAGATTG CCGCTCTCCA TCCGCGGCAA GGTGCAGGCG
GAACACCGGC AATGGTTCCC GGACCCGGCC ACGCTGGCTC TGTTGACCCG CTGTTCGGAC
GATATGCGAG CAATGGGCGC TCGATTGAGG CGCAGAGAAA CTGTTCTCAG GTGTATTCGA
GCGTTTCTCG GAAGGAGCGG TGTACCGAGT CGGGATTTGC CGACCAGCCT GACGGAGCTG
CTGGATTTGC TGCGGATGCA GATGCAATTG CGCCTGCCTC AGATTCTGGT GAATTTCGCA
TGTCGACATG GGTTCGTATC GCAGTCGCTG AGACCATCGT CCTGGGGGGA AATGTTCGGC
TATCCAGGCT TGGAAGATCC GGTTGGCACC TATGGAGATA GTGTCAGATC GGACGAGTAC
GGAGAGGAAA AAACCGATAC TCCCGATTGG GTGCTGGATC TGTGTCGGCA GATCCGGGCA
GGTGACCCTG TTGATCCATC CCCAGCGACT GAGGATTCTG AATCACTTGA GGTGCTTATC
AGAGAGTGGG CTGCCTATCT GGTAGGTGGA TCGTCCGCAT ATGGTCACGA CATCGGGCGC
AGTAGCATCA CTCGGTACGC CCGACTGCTT GGGGAGGCGT TGGCGTCGCA ATTGGATGGC
CAGAGTGTTT TCCAGATGGA GCCCGATGCA CTGGAGATTG TCTATGAGAC GGTCCTCGAT
GCCCAGATAA CCGATAGCAA GCGGCGCACC TTGGCCAAGG CGATTCATGA GTTCCACGCA
TTTCTGGGGC GTCGCTATCA CTATCCACCG ATCAGCCCGT ACTCGGTATT GGGCATCGGC
AGGGATGTGG CTAGCGTTGA TGCCCGGATC CTCTCCGAGG ACCAATATCA GGCCGTTTTG
CGCGCCTTGG ACACCAGCGG GCTGGAATTA CGGACCCCGC GTCTGGTCAC GGCTGCCAAG
CTGTTTCTGA TCCTGGGATT CAGGTTGGGC TTGCGGCGCA ATGAGGCCCT GAAGCTGCGG
CTCAGTGATC TACATTTGCC CGAGTTATCG AGTGACGCCC GCGAACGTAT TCGCGGGCGT
CATCCGGAGA TGCGGATCCT CTCCAACCAG GAGCTGGCAG GGTTGGAGCT GCCGGTCGAC
CTGCTTGTAC GGCCACATGC ACAGCGCGGC CTGAAAACCC AGAACTCGGT TCGGCGGTTA
CCGCTCCGCC TATTGTTGGA GCCCGAAGAA CTGGAACTGT TGATGGTCTG GTACCAGCAA
CGGCAAGCAG AAGAGACACG GGCGCCTTCA TCGGAGTTTC TATTCTGCAT CCCGGAGCTG
AGAACCCAGT GGGTCAGCGA AAGCACGCTG TTGCCGGCAT TGCATGCCTG CATGCGTGCA
GTCACAGGTT CCGAGGTCAT TCACTATCAC CACCTGAGGC ATTCCTGCGC CACTTGGCAG
ATGCTCAAAC TGATGGGAAC CATTACCGAC TCAGCGCCGG AGTTGATATT CCGTGATCTG
CCCCTGACCA CTCGATGGCT CAGCGACAAT GCCAGGCAGC GTGAAGCGCT GATATCTGCC
AACGGTGGAC CCACACGGCG TATCGTCCAT ATCGTCAGTG CCTTGCTCGG TCACGGTAGC
CCCAAAACTT CGCTGCTGCA CTACATCCAC AGCCTACCTC TGGTAATGGC TCAGGCCTGG
CAGTGGAACC CCAGAGTCTG GCTTTTCAGT GCCCATAACG TCGCCTCTAT CGCCAAGGTC
AGCCTGCCCA CGACGGAGGC CAGTTCCGTT GGTGGTCCGG AGCATCTGTT GCGGGTCATC
GGCCGGATCA GGTCGCTCAA GGCCAAAAGA CGGCCTCGAC GGACTGCGGT TTGTTTCGCC
GTTCAGCAGG TCGAAAACAA CTGGGCGATC GAGCGGATTC GCCGGATCGA GTCGATGCTG
GCATACGCAT CTTATGTCGA GAGTAGCGGT CGGCAGATCA ATCTGGAGTG GCTGGAGTTC
GCGACAGAAG AGCGGAGCAT GATGCTGGAT CGAGCTCAGT ATATCCGCAG CTTGACGCAG
AGATCCCAGC CGGAGGCTGG GGGCAAGCAC CGGCTGCGAG CCTCTATGCA GGCTGAGACG
TCATCGCTCA TACCAATGCC TCCTCGACAT GGTGGAAGAG ATGCCGTAGC AGGGTATGCA
GAACGTCTCT ACGAGTTACT GGATGGAGCG GAGAGCGAGC GCGCCAATCG AGCGATAGAC
GATTTCGTCG AACGCTGTTG GGCAACGGAA ACGACGCTGC GCTTTTACCG TGATTGCGAC
GAGGAGCATA CACGGGACTA CCTGTGGCTG CTCACAGCCA TCGGCGTTCC CGCCCAATCC
ATCGAGTTGA TTATCTACGA CACACGAAAG CCCAGAGCCG CCAAGTCATA TTGGCGTCAG
CAGCTCGGGA ATATTCGTCG GCCGATTAGC CAGCATGCGC CCGAAAATCC GGATGCCGAA
AATACCCATC TTGGAATTCG GGCCACGCTG GCGCTAGAGG AGGGGCGGCA GCAGAACCGA
CATTCGGGGG CAGCGCTGCG CTACCTATTC CTGATGGCCT CCATCGACTG GCATTTCCGG
ACATGA
 
Protein sequence
MGGGVNVPRL DNQQVEYLWA AGDHEPDWRP ELREHLKTIA EASGVPLGQW LLGKCSMREV 
SAEHVRDLEA QLGTNHALRN MLRTITQRLR RQIPGFPLAR IAIAVERPAN PINPDIYTAE
RLRRSRQLLN VLQQGLRLDL DSFRPEERIG LLLMSAAYGG GLMDIAQLNA LVEVSLERIE
WIAGIPELRL PLSIRGKVQA EHRQWFPDPA TLALLTRCSD DMRAMGARLR RRETVLRCIR
AFLGRSGVPS RDLPTSLTEL LDLLRMQMQL RLPQILVNFA CRHGFVSQSL RPSSWGEMFG
YPGLEDPVGT YGDSVRSDEY GEEKTDTPDW VLDLCRQIRA GDPVDPSPAT EDSESLEVLI
REWAAYLVGG SSAYGHDIGR SSITRYARLL GEALASQLDG QSVFQMEPDA LEIVYETVLD
AQITDSKRRT LAKAIHEFHA FLGRRYHYPP ISPYSVLGIG RDVASVDARI LSEDQYQAVL
RALDTSGLEL RTPRLVTAAK LFLILGFRLG LRRNEALKLR LSDLHLPELS SDARERIRGR
HPEMRILSNQ ELAGLELPVD LLVRPHAQRG LKTQNSVRRL PLRLLLEPEE LELLMVWYQQ
RQAEETRAPS SEFLFCIPEL RTQWVSESTL LPALHACMRA VTGSEVIHYH HLRHSCATWQ
MLKLMGTITD SAPELIFRDL PLTTRWLSDN ARQREALISA NGGPTRRIVH IVSALLGHGS
PKTSLLHYIH SLPLVMAQAW QWNPRVWLFS AHNVASIAKV SLPTTEASSV GGPEHLLRVI
GRIRSLKAKR RPRRTAVCFA VQQVENNWAI ERIRRIESML AYASYVESSG RQINLEWLEF
ATEERSMMLD RAQYIRSLTQ RSQPEAGGKH RLRASMQAET SSLIPMPPRH GGRDAVAGYA
ERLYELLDGA ESERANRAID DFVERCWATE TTLRFYRDCD EEHTRDYLWL LTAIGVPAQS
IELIIYDTRK PRAAKSYWRQ QLGNIRRPIS QHAPENPDAE NTHLGIRATL ALEEGRQQNR
HSGAALRYLF LMASIDWHFR T