Gene Avin_01050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_01050 
Symbol 
ID7759072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp106074 
End bp107645 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content70% 
IMG OID643803031 
Productsulfate transporter 
Protein accessionYP_002797347 
Protein GI226942274 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.176613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCGACT CGCACACGGA AGAAACCTCC ATGAACCTCG AAACCCTGAA ATCCACCCTG 
CCACGGGACG CCATGGCCTC CGTGGTGGTC TTTCTCGTCG CCCTGCCACT GTGCATGGGC
GTCGCCATCG CCTCCGGCAT GCCGCCGGCC AAGGGTCTGA TCACCGGCAT CATCGGCGGG
CTGGTGGTCG GCTGGCTGGC CGGCGCGCCG CTGCAGGTCA GCGGCCCGGC GGCGGGCCTC
GCCGTGCTGG TCTTCGAACT GGTGCGCCAG CACGGCATCG CCATGCTCGG GCCGATCCTG
CTGCTCGCCG GCCTGATCCA ACTGCTCGCC GGACGCCTGC GCCTGGGCTG CTGGTTCCGC
GTCACCGCGC CGGCGGTGGT CTACGGCATG CTGGCGGGGA TCGGCATCCT GATCATCCTG
TCGCAGTTAC ACGTGATGCT CGACGCTTCG CCGCAGGCTT CCGGCCTGAC CAACCTGCAG
GCCTTCCCGG CGGCGCTGCT CGACGCCCTG CCGTTCGGCG AAGGCAGCGG CTGGAGCGCC
GGGCTGCTCG GTCTCCTGAC CATCGGCACC ATGTGGCTGT GGGACAAGCG CAAGCCGGCC
AGACTGCGTT TCCTGCCCGG CGCGCTGCTC GGCGTGAGCC TGGCGACCCT GGTCAGCCTG
GTCTTCTCGA TGGACGTGCA CCGGGTCGAG GTGCCGGCCA ACCTGGGCGA TGCCATCGAC
TGGGTACAGC CGGCCGACCT GTTGCGCCTG CTGACCGAGC CCAGCCTGCT GATCGCGGCG
CTGACCCTGG CCTTCATCGC CAGCGCCGAG ACCCTGCTGT CGGCCGCCGC GGTGGACCGC
ATGCACCAGG GCCCGCGCGC CGACTTCGAT CGCGAGCTGA CTTCCCAAGG CATCGGCAAC
ATGCTCTGCG GCCTGCTCGG CGCCCTGCCG ATGACCGGGG TGATCGTGCG CAGCTCGGCC
AACGTGCAGG CCGGCGCCGT CACCCGCCTG TCCGCCGTCC TCCACGGCCT CTGGTTGCTG
GCTTTCGTCC TGCTGCTGAC CGCGGTGCTG CAGAGCATTC CGGTGGCCAG CCTGGCCGGG
GTGCTGGTCT ACACCGGGGT CAAGCTGGTC GACCTCAAGG CCCTGCGCGG CCTGGGCCGC
TACGGCCGCA TGCCGATGTT CGTCTACGCC GCCACGGCCC TGGCGATCGT CTGCACCGAC
CTGCTGACCG GGGTGATGAT CGGCTTCGCC CTGACCCTGG CCAAGCTCGC CTGGCGCGCC
TCGCGGCTGA AGATCAGCCT GGTCTACGAC GAGGACGGAC GCACCGCGGA GCTGCGCATG
GTCGGCTCGG CGACCTTCCT CAAGGTGCCT GAACTGTCCC GCGTGCTGGC CACGGTGCGT
CCCGGTACCC AGTTGCACGT ACCGCTGGAT CACCTGAACT ACATCGACCA CGCCTGCATG
GAGCTGCTCG ACGAGTGGAG CGCCGCCAGC GCGGCCAACG GCTCCGAGCT GGTGCTCGAG
CCCCGCGCGG TGAAACGCCG CCTGGAAGGC CGGCTGCGCA CCACGGTGGG CATCGGCGGC
GCGACCGCCT GA
 
Protein sequence
MFDSHTEETS MNLETLKSTL PRDAMASVVV FLVALPLCMG VAIASGMPPA KGLITGIIGG 
LVVGWLAGAP LQVSGPAAGL AVLVFELVRQ HGIAMLGPIL LLAGLIQLLA GRLRLGCWFR
VTAPAVVYGM LAGIGILIIL SQLHVMLDAS PQASGLTNLQ AFPAALLDAL PFGEGSGWSA
GLLGLLTIGT MWLWDKRKPA RLRFLPGALL GVSLATLVSL VFSMDVHRVE VPANLGDAID
WVQPADLLRL LTEPSLLIAA LTLAFIASAE TLLSAAAVDR MHQGPRADFD RELTSQGIGN
MLCGLLGALP MTGVIVRSSA NVQAGAVTRL SAVLHGLWLL AFVLLLTAVL QSIPVASLAG
VLVYTGVKLV DLKALRGLGR YGRMPMFVYA ATALAIVCTD LLTGVMIGFA LTLAKLAWRA
SRLKISLVYD EDGRTAELRM VGSATFLKVP ELSRVLATVR PGTQLHVPLD HLNYIDHACM
ELLDEWSAAS AANGSELVLE PRAVKRRLEG RLRTTVGIGG ATA