Gene Avin_50310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50310 
Symbol 
ID7763881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5099570 
End bp5101141 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content67% 
IMG OID643807861 
ProductABC transporter ATP-binding protein 
Protein accessionYP_002802095 
Protein GI226947022 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.433727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATCG TGAACATGCT CAGCACTGCA CCCAGCACTC ATGGCGTCAT GACCCAACTG 
CCGACAGCCA TGGCTCCCGA GTACCTGCTC GAAATCGTCA ATGTCACCAA GAGCTTTCCC
GGCGTGGTGG CGCTTTCCGA TGTCCAGTTG CGCGTGCGTC CCGGCACCGT GCTGGCGCTG
ATGGGGGAGA ACGGCGCGGG CAAGTCGACG CTGATGAAGA TCATCGCCGG CATCCACCAG
CCGGACACCG GCGAGCTGCG CCTGCGCGGC CAGGCGGTCA GCTTCGAGAC GCCGCTCGCC
GCCCTGCAGG CCGGCATCGC GATGATCCAC CAGGAACTCA ACCTGATGCC CTTCATGAGC
ATCGCCGAGA ACATCTGGCT CGGCCGCGAG CCGCTCAACG CCATGCGCAT GGTCGACCAC
CGCAGGATGC ACCGGCAGAC CAGGGAACTG TTCGAGCGCC TGCGCATCGA CCTCGACCCC
GAGGCGCCGG TCGGCAGCCT GAGCATCGCC GGGCGGCAGA TGGTGGAGAT CGCCAAGGCG
GTGTCCTACG ACAGCGACGT GCTGATCATG GACGAGCCGA CCTCGGCGAT CACCGACAAG
GAGGTCGCGC ACCTGTTCTC GATCATCGCC GACCTCAAGG CCGCGGGTAA GGGCATCATC
TACATCACCC ACAAGATGGA CGAGGTGTTC GCCATCGCCG ACGAGGTGGC GGTGTTCCGC
GACGGCGCCT ACATCGGCCT GCAGAGCGCC GACAGCATGG ACGGCGACGG GCTGATCTCG
ATGATGGTCG GCCGCGAACT CACCCAGCTC TTCCCCGAGC GCCGCGCGCC GCGCGACCAG
GTGGTGCTCT CGGTGCGCGA CCTGGGCCTG GAGGGCGTGT TCCAGGGCGT GTCCTTCGAC
CTGCGCGCCG GCGAGGTGCT GGGCATCGCC GGGCTGATGG GCGCCGGGCG CACCAACGTG
GCGGAAACCC TGTTCGGCGT CACCCCGGCC AGCCAGGGCG AAATCCGCAT CGACGGCGAG
CCGGTGAACA TGAACGATCC CTGCCTGGCG ATCCGCAAGG GCCTGGCGCT GCTCACCGAG
GACCGCAAGG ACACCGGCAT CTTCGCCTGC CTGTCGGTGC AGGAGAACAT GGAGGTCACG
GTGCTGCCCA ACTTCGCCAG CCGCGGCTTC GTGCAGCGCC AGCGCCTGCG CGAGCTGTGC
GAGGAGATGC GCCGCAAGCT GCGCGTCAAG ACCCCCTCGC TGGAGCAGTG CATCGCCAAC
CTGTCCGGCG GCAACCAGCA GAAGGCCCTG CTGGCGCGCT GGCTGATGAC CCAGCCGCGC
GTGCTGATCC TCGACGAGCC CACCCGCGGC ATCGACGTCG GCGCCAAGGC CGAGATCTAC
AAGCTGATCG CCGAACTGGC CGCCGAAGGC ATGGCGGTGA TCATGATCTC GTCCGAACTG
CCGGAAGTGC TGGGCATGAG CGACCGGGTC ATGGTCATGC ACGAGGGCGC GGTGACCGGC
ATCCTCGAGC GCGACGAAGC CACCCAGGAG CGGGTGATGC AACTGGCTTC GGCGACCCCT
TCCGTTCACT GA
 
Protein sequence
MEIVNMLSTA PSTHGVMTQL PTAMAPEYLL EIVNVTKSFP GVVALSDVQL RVRPGTVLAL 
MGENGAGKST LMKIIAGIHQ PDTGELRLRG QAVSFETPLA ALQAGIAMIH QELNLMPFMS
IAENIWLGRE PLNAMRMVDH RRMHRQTREL FERLRIDLDP EAPVGSLSIA GRQMVEIAKA
VSYDSDVLIM DEPTSAITDK EVAHLFSIIA DLKAAGKGII YITHKMDEVF AIADEVAVFR
DGAYIGLQSA DSMDGDGLIS MMVGRELTQL FPERRAPRDQ VVLSVRDLGL EGVFQGVSFD
LRAGEVLGIA GLMGAGRTNV AETLFGVTPA SQGEIRIDGE PVNMNDPCLA IRKGLALLTE
DRKDTGIFAC LSVQENMEVT VLPNFASRGF VQRQRLRELC EEMRRKLRVK TPSLEQCIAN
LSGGNQQKAL LARWLMTQPR VLILDEPTRG IDVGAKAEIY KLIAELAAEG MAVIMISSEL
PEVLGMSDRV MVMHEGAVTG ILERDEATQE RVMQLASATP SVH