Gene Avin_31720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31720 
Symbol 
ID7762072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3280793 
End bp3281791 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content63% 
IMG OID643806046 
ProductSulfate-binding precursor protein 
Protein accessionYP_002800310 
Protein GI226945237 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCATCC GTCGTTTCGC CCTCGCCGCC CTGGCCGGCC TGAGCCTGAG CGCTGCCGCC 
CAGGCGCAGA CCCTGCTGCT CAACGTGTCC TACGACCCGA CCCGCGAGTT GTACCGGGAA
TACAACGCCG CCTTCAACAA GCACTGGCAG GCCGAGGGCC ATGAGCCCGT GACCATCCAG
CAGTCCCATG GCGGCTCGGG CAAGCAGGCG CGCGCGGTGA TCGACGGACT CAAGGCCGAC
GTGGTGACCC TGGCCCTGGC CGGCGATATA GATGAATTGC ACAAGCTCGG CAAGCTGATT
CCCGAGGACT GGCAGAGCCG CCTGCCGCAG GCCAGCACAC CCTACACCTC GACCATCGTA
TTCCTGGTGC GCAAGGGCAA TCCGAAAGGC ATCAAGGACT GGGGCGACCT GGTCAAGCCG
GGCGTGGAAG TGATCACCCC GAATCCGAAG ACCTCCGGCG GCGCACGCTG GAACTTCCTC
GCCGCCTGGG CCTGGGCACA GCAGCAGTAC GGTAGCGAGG ACAAGGCCCG CGCCTACGTC
GAACAGCTCT TCAAGCAGGT TCCGGTGCTG GATACCGGAG CGCGCGGCTC GACCATCACC
TTCGTCAACA ATAAAATCGG CGACGTCCTG CTGGCCTGGG AAAACGAGGC CTTCCTGGCC
CTGAAGGAAC AGGGTGGGGA AAACCTCGAG ATCGTCGTGC CTTCGCTGTC GATCCTCGCC
GAACCGCCGG TGGCGGTGGT GGACAAGAAC GTCGACCGCA AGGGTACCCG CGAACTGGCC
ACCGCCTACC TGAACTATCT GTACAGCGAG GAAGGCCAGC GCATCGCCGC GAAGAATTTC
TACCGTCCGC GCAACGAGAA GGTCGCCACC GAATTCGCCA AGCAGTTCCC CAACCTCAAG
CTGGTGACCA TCGACAAGGA TTTCGGTGGC TGGAAAACCG CCCAGCCGAA GTTCTTCAAC
GATGGCGGGG TGTTCGATCA GATCTACAAG GCGCACTGA
 
Protein sequence
MSIRRFALAA LAGLSLSAAA QAQTLLLNVS YDPTRELYRE YNAAFNKHWQ AEGHEPVTIQ 
QSHGGSGKQA RAVIDGLKAD VVTLALAGDI DELHKLGKLI PEDWQSRLPQ ASTPYTSTIV
FLVRKGNPKG IKDWGDLVKP GVEVITPNPK TSGGARWNFL AAWAWAQQQY GSEDKARAYV
EQLFKQVPVL DTGARGSTIT FVNNKIGDVL LAWENEAFLA LKEQGGENLE IVVPSLSILA
EPPVAVVDKN VDRKGTRELA TAYLNYLYSE EGQRIAAKNF YRPRNEKVAT EFAKQFPNLK
LVTIDKDFGG WKTAQPKFFN DGGVFDQIYK AH