Gene Avin_30540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_30540 
SymbolcysP 
ID7761954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3163324 
End bp3164322 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content65% 
IMG OID643805930 
ProductSulfate ABC transporter-binding component-CysP-like protein 
Protein accessionYP_002800194 
Protein GI226945121 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAT TGCTGACCTC TTCCCTGCTG GCGGCCGGCG TGGCCCTGGC CTCCGTCGCC 
AGCCAGGCGG CCCCGTTACT GAACGTCTCC TACGACGTGA TGCGCGACTT CTACAAGGAG
TACAACCCGG CCTTCCAGAA ACACTGGCAG GCCGAGGGCA ACCCACCGGT ACAGATCCAG
ATGTCCCACG GCGGCTCCAG CAAGCAGGCG CGCGCGGTGA TCGACGGCCT GCCGGCCGAC
GTCATCACCA TGAACATGGC CACCGACATC AATGCCCTGT ACGACCACGG CAAGCTGATT
CCGCAGAACT GGGCCGAGCG CCTGCCGGAC AACAGCGCCC CCTTCACCTC GGCGACCGTG
TTCATCGTCC GCAAAGGCAA CCCGAAACAG CTCAAGGACT GGCCCGACCT GCTCAAGGAG
GGCGTGCAGG TGGTGGTGCC CAATCCCAAG ACCTCGGGCA ACGGCCGCTA CACCTACCTG
TCGGCCTGGA GCTACGCACT GAAGAACGGT GGCGACGACA AGGCCGCGCG CGACTTCGTC
GGCAAGCTGT TCAAGCAGGC GCCGGTGCTC GACACCGGCG GTCGCGCCGC TACCACTACC
TTCATGCAGA ACCAGATCGG CGACGTGCTG GTGACCTTCG AGAACGAGGC GGAAATGATC
GCCCGCGAAT TCGGCCGCGG CGGCTTCGAG GTGGTCTATC CCAGCATCTC CGCCCAGGCC
GAACCGCCGG TGGCGGTGGT CGACAAGGTG GTCGACAAGA AGGGCACCCG CAAGGAGGCC
GAGGCCTACC TGAAATACCT ATGGTCCGAC GAGGGCCAGC GCATCGCCGC CAACAACTAC
CTGCGCCCGC GCAATCCGAA GATCCTCGCC GAATTCTCCG ACCGCTTCCC CAAGGTCGAA
TTGCTCGACG TGGTGAAGAC CTTCGGCGAC TGGCCGACCA TCCAGAAGAC CCACTTCAAC
GACGGCGGCG TGTTCGACCA GGTCTACGGC GGACGCTGA
 
Protein sequence
MKRLLTSSLL AAGVALASVA SQAAPLLNVS YDVMRDFYKE YNPAFQKHWQ AEGNPPVQIQ 
MSHGGSSKQA RAVIDGLPAD VITMNMATDI NALYDHGKLI PQNWAERLPD NSAPFTSATV
FIVRKGNPKQ LKDWPDLLKE GVQVVVPNPK TSGNGRYTYL SAWSYALKNG GDDKAARDFV
GKLFKQAPVL DTGGRAATTT FMQNQIGDVL VTFENEAEMI AREFGRGGFE VVYPSISAQA
EPPVAVVDKV VDKKGTRKEA EAYLKYLWSD EGQRIAANNY LRPRNPKILA EFSDRFPKVE
LLDVVKTFGD WPTIQKTHFN DGGVFDQVYG GR