Gene Avin_35420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_35420 
Symbol 
ID7762437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3615615 
End bp3616970 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID643806411 
ProductSodium/sulphate symporter-like protein 
Protein accessionYP_002800666 
Protein GI226945593 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGCCA TACTGGCTTT CGCCGTGATC ATCTGGATTT CGGAGGCGAT GGATTACACC 
GTCAGCTCCA TCCTGATCGC AGCCCTGATC ATTTTCCTGG TGGGCAGCGC CCCCTCGGTG
GCCAATCCCG ATGTGCCATA CGGTACCCAG GGGGGACTCA AACTGGCGCT GGCGGGTTTT
TCCAATTCGG GCCTGGCGCT CGTCGCCGCC GCTTTGTTCA TCGCTGCCGC CATGACGGTG
ACGGGGCTCG ACCAGCGTAT TGCGCTATTC ACCATGTCCC ATATCGGGGC GGGTGGCAGA
AGGGTGCTGA TCGTTTCCAT CATCGTCACT ATCCTGCTCA GCTTCGTGGT ACCCAGTGCC
ACCGCCCGCA CGGCCTGCGT GGTCCCCATC ATGCTGGGCG GTATCGCTTC TCTGAAACTG
GATCGCAAGG GGCCGCTCGC CGCTTCGATC ATGATCACCA TCGCCCAGGC GACCAGCATC
TGGAACATCG GCATCATGAC CTCCGCCGCC CAGAACCTGT TGTCGCGGGG GTTCGTGGAA
AAGCAGTTCG GCGCCGACCA GGCGCTGGCC TGGGTCGATT GGCTGATCGC CGGCGCGCCC
TGGGCGGTCA TCATGTCGGT CATCCTTTAT TTCGTGGTGC GCAAGGTGTT GCCGCCAGAA
ATCGAGGAAA TCCCCGGTGG CAAGCAGGCC ATGGCCGAAT CCTATAGGGC GCTGGGTTCC
ATGACGGCTC CCGAAAAGCG CCTGCTGGCG ATCTCGCTCG TTCTCCTCGG ACTCTGGGCG
ACGGAGAACA AGCTTCATTC GCTCGACACC GCCTCTACCA CCATCGCCGG TGTCGCTCTG
ATGCTGCTTC CCGGTATCGG CGTGATGACC TGGAAACAGG CGCAGAAGCT GATTCCCTGG
GGAACGGTGA TCGTCTTCGC CGTTGGCATC AGCCTGGGCA CCGCGCTGCT CGATACCAAG
GCGGCACAGT GGCTTTCCAG TTCCGTGGTA CAGGCTTTCC AGCTTGATAC GCTGTCGCCC
TTCGCCATTT TCGCCCTGCT GTCGGCTTTC CTGATTCTGA TCCACCTGGG CTTTGCCAGT
GCCACCGCGT TGACCGCCGC CCTGATGCCG ATCCTGATCA ATGTGCTGGC CAGCGTGCCG
GATCTGAACG CACCGGGCAT CGCCATGCTG TTGGCCTTCA CTGTCAGCTT CGGTTTCATC
CTGCCCGTGA ACGCACCGCA GAACATGGTC TGTCTGGGTA CGGAAACCTT CACTACGCGA
CAGTTCACCA TGGTTGGCCT CTGGCTCACG GTGGCCGGCT ATGCGCTGCT GCTCCTCTTC
GCCCTCACCT GGTGGCCCCT GCTGGGGCTG ATCTGA
 
Protein sequence
MLAILAFAVI IWISEAMDYT VSSILIAALI IFLVGSAPSV ANPDVPYGTQ GGLKLALAGF 
SNSGLALVAA ALFIAAAMTV TGLDQRIALF TMSHIGAGGR RVLIVSIIVT ILLSFVVPSA
TARTACVVPI MLGGIASLKL DRKGPLAASI MITIAQATSI WNIGIMTSAA QNLLSRGFVE
KQFGADQALA WVDWLIAGAP WAVIMSVILY FVVRKVLPPE IEEIPGGKQA MAESYRALGS
MTAPEKRLLA ISLVLLGLWA TENKLHSLDT ASTTIAGVAL MLLPGIGVMT WKQAQKLIPW
GTVIVFAVGI SLGTALLDTK AAQWLSSSVV QAFQLDTLSP FAIFALLSAF LILIHLGFAS
ATALTAALMP ILINVLASVP DLNAPGIAML LAFTVSFGFI LPVNAPQNMV CLGTETFTTR
QFTMVGLWLT VAGYALLLLF ALTWWPLLGL I