Gene Avin_20980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20980 
SymbolcysE3 
ID7761023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2093069 
End bp2094052 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content67% 
IMG OID643804993 
ProductSerine O-acetyltransferase 
Protein accessionYP_002799274 
Protein GI226944201 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1045] Serine acetyltransferase 
TIGRFAM ID[TIGR01172] serine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.101703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACA AACCCAAGGC CGTCTTCGAC GGCGTCTCGC ATACGCCGGA GGTACCCGCG 
CCCCGGCCGG TGAACTGGCA ACTGGAGCAG ATCGTCGATG AACTGCGTGC CGCTCGCGCC
GATTGGCGCA AGAACTGCGG CCGGACGCGC GAGCTGGGCA GCCGCGAATT GCCCTCGCGG
CAGACCGTGG CGGAGATTCT CGCCGCGCTG AGCGGCGCGC TCTTTCCGAT GCGCCTCGGG
CCCAGCGATC TGCGCGAAGA GAGCGAGGAT TTCTATGTCG GGCACACCCT CGACAGCGCA
CTGAACGCGC TGCTCGGCCA GGTACGCCTG GAACTGCACT ATGTCGCCCG CCAGTGCGGG
CAGCGCGAGC CGGATCTGGA GACGCGCGCG GTGCAGATCG TCCGCGAATT CGGCGCCGCC
TTGCCGGAAA TGCGCCGGCT GCTGGACAGC GACGTGATCG CCGCCTACCA GGGCGATCCG
GCGGCGCGCA GCCTGGACGA GGTACTGATC TGCTATCCGG GCGTCCAGGC GGTGATCCAC
CATCGCCTGG CCCATCTTCT GTACCGCTCC GGGGTGCCGC TGCTGGCGCG GATCGTCGCG
GAGATCGCTC ATTCCGCCAC CGGCATCGAC ATCCATCCGG GAGCGCAGAT CGGTCACAGC
TTCTTCATCG ACCACGGGAG CGGCGTGGTG ATCGGCGAAA CCGCGGTGAT CGGCAACCGC
GTACGCATCT ATCAGGCGGT GACCCTGGGC GCCAAGCGCT TCACCGTCGA CGAGTCCGGC
CAGTTGCTCA AGGGCCAGGC CCGTCATCCT ATCGTCGAAG ACGACGTGGT AATTTATGCC
GGCGCCACCA TTCTGGGCCG CATCACCATC GGCAAGGGTT CCATCATCGG TGGCAATGTC
TGGCTGACCC GTAGCGTGCC TCCGGGCAGC AACGTCACCC AGGCGACCTT GCAACATCAG
CCAGGCAACG CGGGGCAGCC GTGA
 
Protein sequence
MSNKPKAVFD GVSHTPEVPA PRPVNWQLEQ IVDELRAARA DWRKNCGRTR ELGSRELPSR 
QTVAEILAAL SGALFPMRLG PSDLREESED FYVGHTLDSA LNALLGQVRL ELHYVARQCG
QREPDLETRA VQIVREFGAA LPEMRRLLDS DVIAAYQGDP AARSLDEVLI CYPGVQAVIH
HRLAHLLYRS GVPLLARIVA EIAHSATGID IHPGAQIGHS FFIDHGSGVV IGETAVIGNR
VRIYQAVTLG AKRFTVDESG QLLKGQARHP IVEDDVVIYA GATILGRITI GKGSIIGGNV
WLTRSVPPGS NVTQATLQHQ PGNAGQP