Gene Avin_31360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31360 
Symbol 
ID7762035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3242568 
End bp3243539 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content70% 
IMG OID643806010 
ProductABC transporter periplasmic aliphatic sulfonate-binding protein 
Protein accessionYP_002800274 
Protein GI226945201 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.694075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGAA AAGGCTTCAA GCGCCTGCTC GGCGCCGCGC TGCTCGGCGG TTCCCTGCTC 
GCCGGCGTCC AGGCGTTCGC CGCCAGCCTG GTGGTCGGCG ACCAGAGTTT CAATGCGCGC
ACGGTCATGG AGGCCGCCGG GGTGCTCGCC GACCTGCCCT ACGAACTGGA ATGGAAGCAG
TTCACCGCCG GCTCGCCGGT GGCCGAGGCG CTCAACGTGG GCAGCCTGGA CGTGGGCCTG
CTGGGCGACG CCCCGCCGCT GTTCCTCGGC GCGCTGGGAG CGCCGATCAA GGTGATCGGG
ATCAGCCAGC AGAACCGCGA GGGGGTCGCA ATCCTGGTGC GCAAGGACTC GCCCATCCGT
CGCCTGGAAG ACATCCGCGG CCATAGCGCG GCGATCTGGA AAGGCTCCTG GAGCCAGCAA
TTGCTGTTCA CCGCCCTGGA ACGGGCCGGG GTTTCGCCCG AACAGGTGCA ACTGCGCTAC
CTCGGCGCGC TGGATGCCTC GCATGCCCTG GAAGGCGGCT CGGTGGACGT GATCGCCACC
TGGGAGCCCT ATGTCACCCA GCAGGAGCTC CAGGGCGCCC GCGTGCTGGC CACCGCCGAG
GAGCTGATCC CGGCGCAGAC CTTCGTGGTC GCCACCGACA AGGCCATCGC CGCCAAGCGC
GAGCTGCTCG GCGACTTCCT GCGCCGCCTG CGCCAGGCTC GCGACTGGGT ACTGAGCGAC
CCGGCCAACA GCGAACGCTA CGCCGACACC TGGGCCGAAC TGACCCGCGC CGACCGCGAA
GTGGCGCGCC GCTGGTTCGC CCGCGCCGCC ATCCGGGTGC GCCCGGTGGA CGAGGCGGCC
ATCGCCGAGG CCCAGAAGAC CGTGGATTTC TTCAGCCGGA TCGGCCTGAT CAAGGGCTAT
CCGGCCGCCA GCCTGTTCGA CGCCTCTTTC AACGCTTATC TGCAGTCACC GGCCGCGCAG
GCGGCCCGGT AG
 
Protein sequence
MTGKGFKRLL GAALLGGSLL AGVQAFAASL VVGDQSFNAR TVMEAAGVLA DLPYELEWKQ 
FTAGSPVAEA LNVGSLDVGL LGDAPPLFLG ALGAPIKVIG ISQQNREGVA ILVRKDSPIR
RLEDIRGHSA AIWKGSWSQQ LLFTALERAG VSPEQVQLRY LGALDASHAL EGGSVDVIAT
WEPYVTQQEL QGARVLATAE ELIPAQTFVV ATDKAIAAKR ELLGDFLRRL RQARDWVLSD
PANSERYADT WAELTRADRE VARRWFARAA IRVRPVDEAA IAEAQKTVDF FSRIGLIKGY
PAASLFDASF NAYLQSPAAQ AAR