Gene Avin_31110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31110 
Symbol 
ID7762011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3216547 
End bp3217629 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content68% 
IMG OID643805986 
ProductABC transporter substrate binding protein 
Protein accessionYP_002800250 
Protein GI226945177 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.24737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTCG ACTCCAACCT CTCCCGCCGT CGCCTGCTCG GCCTGGCCGG CACCGCCGCC 
GCGGCCGCCG CCGTCGGCCG CTTCGCCTGG GCCGCCGATC CGCACGCCCA TGCCGCCCAT
GCCGCCCATG GCGCCCATGG CGACGACGCA CAGAATTTCC TGCGCGACGC CAAGAGCTGG
GCACTGCCGG CGCCGCGCAA GCTCAAGCTG GCGACCAACC TCAACGCCAT CTGCCTGGCC
CCGGTGGCGG TGGCCGACAG CCAGGGCTTC TTCCGCAACC ACAACCTGGA GGTCGAGTTC
GTCAACTTCG GCAATTCCAC CGAGGTGCTG CTCGAATCCC TAGCCACCGG CAAGGCGGAT
GCCGCCACCG GCATGGCGCT GCGCTGGCTG AAGGCGCTGG AACAGGGCTT CGACGTCAAG
CTGACCGCCG GCACTCACGG CGGCTGCCTG CGCCTGATCG CCCAGGAAGG CGGCCCGCGC
AGTTTCGAGG AACTCAAGGG CAAGACCATC GGCGTCACCG ACATGGCCAG CCCGGACAAG
AACTTCTTCT CGCTGATGCT CAAGCGCCAC GGCGTCGACC CGGTCCGCGA CGTGACCTGG
CGGGTCTATC CGATCGACCT GCTCGGCACC GCCCTGGAGA AGGGCGAGGT CCAGGCGGCC
AGCGGCTCCG ACCCGATGAT GTACCGCCTG CGCAACCAGC CGGGCAAGCG CGAGCTGTCC
AACAACCTGG TCGAGGAGTA CGCCAACCTG AGCTGCTGCG TGGTCGGCGT CGGCGGCAAC
CTGGTGCGTA AGGAGCGGCC GGTCGCCGCC GCCGTCACCC ACGCCATCCT GCAGGCCCAC
GCCTGGGCGG CGCAGCACCC GGAAACCGTG GCCCAGGACT TCCTCAAGTT CGCGGTCAAC
ACCAATTCCG AGGAAATCAA CGCCATCCTC AACGAGCACA CCCACGCGCA CTACTCGGTG
GGCAAGGCCT TCGTCGACGA GATCGCCGTC TACGCCCGCG ACCTGAAGGC CGTGGAAGTG
CTGCGCGCCA GCACCGATCC CCGGAAATTC GCGGAGAGCA TCCATGCCGA CGTATTCGGT
TGA
 
Protein sequence
MTFDSNLSRR RLLGLAGTAA AAAAVGRFAW AADPHAHAAH AAHGAHGDDA QNFLRDAKSW 
ALPAPRKLKL ATNLNAICLA PVAVADSQGF FRNHNLEVEF VNFGNSTEVL LESLATGKAD
AATGMALRWL KALEQGFDVK LTAGTHGGCL RLIAQEGGPR SFEELKGKTI GVTDMASPDK
NFFSLMLKRH GVDPVRDVTW RVYPIDLLGT ALEKGEVQAA SGSDPMMYRL RNQPGKRELS
NNLVEEYANL SCCVVGVGGN LVRKERPVAA AVTHAILQAH AWAAQHPETV AQDFLKFAVN
TNSEEINAIL NEHTHAHYSV GKAFVDEIAV YARDLKAVEV LRASTDPRKF AESIHADVFG