Gene Avin_08230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_08230 
Symbol 
ID7759777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp780269 
End bp782041 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content66% 
IMG OID643803741 
ProductArylsulfotransferase 
Protein accessionYP_002798043 
Protein GI226942970 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCATT TTGAACAACA TAGCTCCCCA TCGCCTTCCG CCCATCCGGA GCGCGGGGCA 
CAGCCCGAGC AGGCATTGCC CGAAGGCGCC TGTCTCACCG CCCGCATCCC CGACAGCGAC
AACGCCAGGC TGGGCGCCAT CGTCGTCAAC CCGTATCGCC TGGCTCCGTT GACCGCCGTC
ATCCGCGACG GGGGCCAGCG CATCTCCCAG GCGCGGGTGC GGGTCAAGGG ACGCGGCGAG
GGAGGGGTGG ACATCGATTA CGCGGTCGAG GATCGCACTC TCTGGACTCA CGGCGGCATC
CCGGTGTTCG GTCTCTATCC GGACCATCGC AACGAAGTCG AGGTGGCCTA CAGCCTGGAC
GGCGAACGCA TTCGCGAGCG TTACTTCATC TATGCTCCGG CGGTGCGTCT GCCGGTGGTG
GCCAGTCAGG AAAGCGCGCT GCCACGGGTC GAGCCGCTGA AGGTGGCGCC GCACCTGAAA
AAGCGCCTGT ACCTGTTCAA CCACCTGCTC ACCGAGATTC CCGGTAGCAA CCGCCTGCTG
AAGTGGAACG CTCCGGGCGG CGCCGCCGAG TGGGACTCGG TGGGGATCAA TTGGATCGCC
GACAGCAACG GCGACGTGCG CTGGTACCTG GACATCGAGC AGATCCACGA CTCCACCCGC
AAGGATGGGC TGGGGGCGAG CATGGGCTTC CAGCAGACCC GCGACGGCCA CCTGATCTGG
GGCCAGGGCC AGCGCTACTA CAAGCACGAC CTGCTGGGCC GCACGATCTG GGCGCGCACC
TTGCCGGAGA AGTTCGCCGA CTTCTCCCAT GAGATCCGCG AGACCGAAAA AGGTACCTAC
CTGCTGCGGG TCGGTACCAG CGACTATCGT CGCGCCGACG GAAAGCGGGT GCGCTCGATC
CGCGACCACA TCCTGGAAGT CGACCAGAAC GGCGATGTGG TGGACTTCTG GGACCTGAAC
CGAATCCTCG ACCCCTATCG CGCCGAACTG CTGCACACCC TGGGACGGGC CGCGGTGTTG
CTGCCCGAGG GCGTCGCCAG GAGTGACGAT CTGCTGGACA ACGAACGTAA CGAAGGCGAT
GTCCTGCCGT TCGGTGACAC ACCCGGTGTC GGCACCGGGC GCAACTGGGC CCACGTCAAC
GCCATCGAGC ATGACCCGGC CGACGACAGC ATCATCGTCT CCGCCCGTCA CCAGGGGGTG
GCCAAGATCG GGCGCGACAA ACAGGTGAAA TGGCTGCTGG CCGATCCGCG AGGATGGAGC
AGCGCCCTGC GGGCCAAGGT GCTCACGCCG GTAAACGCCG CTGGCGAGGT ATTGGCGCAG
AATCCCAACG GCAGTTATCC CGAGGGCTTC GACTGGTCCT GGACCCAGCA CACCGCCTGG
CTCAGCAGCA AGGGAACGCT CACCGTGTTC GATAACGGCT GGGGTCGCAA CCTGGCGCCC
ACTCGCCTGG AAGGCAACTA CAGCCGGGCG GTGGAATACC GCATAGACGA GGAGAAGGGC
ACCGTGCAGC AACTCTGGGA GTTCGGCAAG GAACGCGGCG ACGCCTGGTA CAGCCCGATC
ACCTCGGTGG TGGAGTACCG CCCGGAATCC GACACCCTGC TGATCTACTC GGCGGCGATC
GGGCATCTGA CGCCACAGCG GCTGACCCGG CCGGTGCTCA GCGAAGTGAA GTACGGCACC
CAGGAAGTGC TCAGCGAATT CCGCGTGGTC AGCGGCCAGC CGGGCAACGT CGGTTACCGG
GCGCTGGTGA TCGATCTGGA GCGGCTGTTC TGA
 
Protein sequence
MSHFEQHSSP SPSAHPERGA QPEQALPEGA CLTARIPDSD NARLGAIVVN PYRLAPLTAV 
IRDGGQRISQ ARVRVKGRGE GGVDIDYAVE DRTLWTHGGI PVFGLYPDHR NEVEVAYSLD
GERIRERYFI YAPAVRLPVV ASQESALPRV EPLKVAPHLK KRLYLFNHLL TEIPGSNRLL
KWNAPGGAAE WDSVGINWIA DSNGDVRWYL DIEQIHDSTR KDGLGASMGF QQTRDGHLIW
GQGQRYYKHD LLGRTIWART LPEKFADFSH EIRETEKGTY LLRVGTSDYR RADGKRVRSI
RDHILEVDQN GDVVDFWDLN RILDPYRAEL LHTLGRAAVL LPEGVARSDD LLDNERNEGD
VLPFGDTPGV GTGRNWAHVN AIEHDPADDS IIVSARHQGV AKIGRDKQVK WLLADPRGWS
SALRAKVLTP VNAAGEVLAQ NPNGSYPEGF DWSWTQHTAW LSSKGTLTVF DNGWGRNLAP
TRLEGNYSRA VEYRIDEEKG TVQQLWEFGK ERGDAWYSPI TSVVEYRPES DTLLIYSAAI
GHLTPQRLTR PVLSEVKYGT QEVLSEFRVV SGQPGNVGYR ALVIDLERLF