Gene Avin_21910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21910 
SymbolasfC 
ID7761109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2187847 
End bp2188836 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content68% 
IMG OID643805076 
ProductABC transporter, substrate-binding protein, aliphatic sulphonate 
Protein accessionYP_002799357 
Protein GI226944284 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.58658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACG CTAGCCGTCT CTGTCTCACC CTGGCCGGGC TGCTGTCCTG CTCGGGAATC 
GCCTGGGCGC AGAACCTGCA AGCCCTGCGG GTGGCCAATC AGAAATCCGG CATCAAGCTG
CTCCTGGAGG CGGCCGGGGA ACTGCAAGAG GTGCCCTACG CCATCCAGTT CTCCGAATTT
CCGGCGGCCG CGCCGCTGGG CGAGGCGCTG AACGCCGGCG CGGTGGATGT CGGCGGCCTG
GGCGACGCGC CCTACGTTTT CGCCCTGGGC AGCGGTGCAG CGCTGAAGGT CGTCTGCATC
GTCCATGCGG CCGGCCGCCT GAGCACGGCG ATCATCGTGC CCAAGGACTC GCCCCTGCAC
GGCGTCGCCG ACCTGAAGGG TAAGCGCATC GTCACCGGAC GCGGCTCGAT CGGGCATTTC
CTGGCGCTCA AGGCCCTGCG CGAGGCGGGA CTGCAAAGCA GCGACGTACG CTTCGTCAAC
CTGCTGCCCA GCGACGCGCG CAGCGTCCTG GAGAGCGGCG GCGCCGACGC CTGGTCGACC
TGGGACCCGT ACACCGCCAT CGCCATCACC CAGGGCGCCC GGGTGCTGGT CAACGGCAGC
CACCTGCTCA GCAACAACTT CTATCTGGCG GCGACCGCCC AGGCCATCGA GGACAAACGC
CCGCAACTCA CGGACTTCGT GAAGCGGCTG GAGCGCGCCT ATCGATGGGC CAACCAGCAT
CCGGACGCCT ACGCCGCCGC CCAGTCCAGG GTCACCGGCC TGTCCCGCGA GACGCACCTG
GAGTCGGCCA GGAATACCCG TTTCCAGCGG GTCCCGATCG ACGATGCGCT GATCGAGGGT
CTGCAGGCGA CCGCCGACCT GTATTTCGAG GAAGGCATCA CCGGCAAGCG AATCGAGGTT
TCGCAGGGCT TCGACAGGAG TTTCAACGAG GCGGCCGACG GACCGCTTCT ACCCGCCCCG
TCCCGGGTCC AGGCCTCCGG CCGCCCATGA
 
Protein sequence
MKNASRLCLT LAGLLSCSGI AWAQNLQALR VANQKSGIKL LLEAAGELQE VPYAIQFSEF 
PAAAPLGEAL NAGAVDVGGL GDAPYVFALG SGAALKVVCI VHAAGRLSTA IIVPKDSPLH
GVADLKGKRI VTGRGSIGHF LALKALREAG LQSSDVRFVN LLPSDARSVL ESGGADAWST
WDPYTAIAIT QGARVLVNGS HLLSNNFYLA ATAQAIEDKR PQLTDFVKRL ERAYRWANQH
PDAYAAAQSR VTGLSRETHL ESARNTRFQR VPIDDALIEG LQATADLYFE EGITGKRIEV
SQGFDRSFNE AADGPLLPAP SRVQASGRP