Gene Avin_24970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_24970 
Symbol 
ID7761411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2499734 
End bp2501395 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content66% 
IMG OID643805379 
ProductSulphate transporter-SulP-type 
Protein accessionYP_002799656 
Protein GI226944583 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATCG CAATACGTGA GGCATGGCAG GCCGGTCTGC TGCGCCGGGA GCACTGGCTG 
CGCAACCTGG TCGCCGGCGT CATCGTCGGT GTCGTGGCGC TGCCGCTGGC CATGGCCTTC
GGCATCGCCT CCGGGGTGAA GCCGGAGCAG GGCATCTATA CCGCCATCGT CGGCGGCCTG
CTGGTTTCCC TGTTCGGCGG CAGCCGGCTG CAGATCGCCG GTCCGACCGG CGCCTTCATC
GTCATCCTGG CCGGGGTGAC CGCCGAGCAC GGGGTCGATG GCCTGCAGAT CGCGACTCTG
ATGGCCGGCG GCATTCTCCT GCTGCTCGGT GTCGGCCGCC TGGGGACGGT CATCAAGTAC
ATTCCGGACC CGGTCATCGT CGGTTTCACG GCCGGCATCG GGGTGATCAT CTGGGTCAGC
CAGTGGAAGG ATTTCTTCGG ACTGCCGGCA GTGGCGGGCA CGCATTTCCA CGAGAAATTC
TGGCACCTGC TGCAGTCGCT GCCGCAACTG CACCTGCCGA CCACGCTGCT GGCCCTGGCC
AGCCTGGCGC TGGTCATCTT CGCACCGCGC CTGCGAATGC TGCGGCGCGT TCCCGGTCCG
CTGATCGCGA TGGTCGCCGC CACCGTCATC CAGTCGTTCT TCCGGTTCGA CGGAGTGGCC
ACCATCGGCA GCGCTTTCGG CGGCATTCCC CAGGGATTGC CGCAGCTCGG GCTGCCGGAG
ATCACTCCGT CGCGGCTGAT CGAACTGATC GGCCCGGCCT TCGCCATCGC CATGCTCGGC
GCCATCGAAT CGCTGCTCTC CGCCGTGGTC GCCGACGGCA TGGGCGGAAC CAAGCACGAC
TCCAACCAGG AATTGATCGG TCAGGGCATC GCCAACCTGG CCACGCCGCT GTTCGGCGGC
TTCGCCGCCA CCGGAGCGAT CGCCCGTACC GCGACCAACA TCCGCAACGG CGGCAACAGT
CCCCTGGCCG GCATCGTCCA TGCGCTCGTG CTGGTGCTGA TCCTGCTGTT CCTGGCGCCG
CTGGCGGCGA ACATTCCGCT GGCCGCGCTG GCCGCCATCC TCTTCGTGGT GGCCTGGAAC
ATGAGCGAGC TGAAGCACTT CAAGCGCATG CTGCGGCGGG CGCCCAGGGC GGACGTCGGC
ATTCTGCTGA TCACCTTCGG GTTGACGGTG TTCAGCGATC TGGTGATCGC GGTGAACATC
GGCGTGATCC TGGCCATGCT GCAGTTCATG CGCCGCATGG CCTCTTCGGT GGCCGTGCGG
CAGCAGCTCG AACGGGATCT GGAGCCGGAA CTGCTCGGCA ACGGGCATAG CCGGCTGCCC
GACGGGGTAC TGGTCTATAC CGTGGAGGGG CCGCTGTTCT TCGGCGCGGC GGAAACCTTC
GAGCGTGCGC TGGCCAGTAC CCATACCGAT CCGCGCCTGT TGATCATTCG CCTGAAGCGG
GTACCTTTCA TGGATATCAC CGGTCTACAG ACGCTGGAGG AAGTCATCCG GCAACTGGAA
AGACGCAGGA TCCGGGTCAA GCTCTGCGAA GCGAGCCCAC GGGTGCATGG CAAGCTGGAG
CGGGCGGGAA TACTGGAGTT GATCGGTGCA CGCGATTATC ACGCGAGCTT CGCCGAGGCC
CTGTCCGCCA GCGAGGAAAA GGCGGAGGTC ACGGCTTCCT GA
 
Protein sequence
MMIAIREAWQ AGLLRREHWL RNLVAGVIVG VVALPLAMAF GIASGVKPEQ GIYTAIVGGL 
LVSLFGGSRL QIAGPTGAFI VILAGVTAEH GVDGLQIATL MAGGILLLLG VGRLGTVIKY
IPDPVIVGFT AGIGVIIWVS QWKDFFGLPA VAGTHFHEKF WHLLQSLPQL HLPTTLLALA
SLALVIFAPR LRMLRRVPGP LIAMVAATVI QSFFRFDGVA TIGSAFGGIP QGLPQLGLPE
ITPSRLIELI GPAFAIAMLG AIESLLSAVV ADGMGGTKHD SNQELIGQGI ANLATPLFGG
FAATGAIART ATNIRNGGNS PLAGIVHALV LVLILLFLAP LAANIPLAAL AAILFVVAWN
MSELKHFKRM LRRAPRADVG ILLITFGLTV FSDLVIAVNI GVILAMLQFM RRMASSVAVR
QQLERDLEPE LLGNGHSRLP DGVLVYTVEG PLFFGAAETF ERALASTHTD PRLLIIRLKR
VPFMDITGLQ TLEEVIRQLE RRRIRVKLCE ASPRVHGKLE RAGILELIGA RDYHASFAEA
LSASEEKAEV TAS