Gene Avin_31350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31350 
Symbol 
ID7762034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3241379 
End bp3242557 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content72% 
IMG OID643806009 
ProductDszC-like desulfurization enzyme 
Protein accessionYP_002800273 
Protein GI226945200 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGCC CATCCACCCC CGAAGACTGG CTGGCGCTGG CCCATGGGAT CGGCGAGGAA 
TTCGCCCGCG ACATCGCCCG CCGCACCCGC ACCCGCGAAC GTCCCGACAC CCAGTTGCGG
CGCCTCAAGG ACAGCGGCCT GACCAATCTG GCGATCCCCC GCGAGCTGGG CGGCGCCGGC
CAGCGCTGGT CGCTGATCGT GCGCACCATC CGCGAACTGG CCGCCGGCGA CGGCTCGGTC
GGCATGCTCT ATGGCTACCA CCAGCTCAAC CTGGTCAACC TGCGCCGCGA GCCGCAGCCG
CGCCGCGACC GCCTTCTGGC GGAGATCGCC GAACGCCGGC TGTGGCTGGC CGGGGTGGTC
AATCCGCGCG ACGACGACAT CCTCGCCACG CCGGACGGCG AGGGTTTCCG CCTCAACGGC
CGCAAGGGCT TCTGCAGCGG CGCGGCCTTC GCCGATCTGC TCAGCGTCAG CGCGCGCCAT
GCCCACGACG GCCAGCGGCT GATGGCGCTG ATCCCTTCCG ACCGTCCCGG CCTGCACTAC
GCCGAGGATT GGGACCATTT CGGCGTGGAG CGCAGCGACA GCGGCAGCTT CGTCCTGAGC
GAGGTGCGCG CCGAACCCGG CGAAGTCATC GCCAACGACC TGGAGGACGG CAGCGATTTC
TCCGCGGTGA TCCGCACCCC GGTCAACCAG TCCGCCTTCA CCCAGTTCTA CCTCGGCAAC
GCCCTCGGCG CGCTGCGCGC GGCGCGCGCC TACGTGCACC GCGAAGGCCG CGCCTGGCTG
CACGCGGGGG TCGACGAGGC CCATCGGGAC CCGCTGCTGG TCAGCCAGTT CGGCGAGCTG
TGGATCGCCC TGCAGGGCGC CATCGCCCTG GCCGACCGCG CCGCGCTCAA GGTCGACGAG
TTGCTCGCCG CGGACGAGGC CTTCACCCCG GAACTGCGCG GCGAGGCCGC CGTCGAGGTG
GCCAGCGCCA AGGTGCTCGC CGCGCGCACC GCGCTCGATG TCACCAGCCG GGTGTTCGAG
GTGATGGGCG CGCGCGCCAC CCACAACCGC TACGCCTTCG ACCGCTTCTG GCGCGACACG
CGCACCCACA GCCTGCACGA CCCGCTCGCC CACAAGCTGC TGGAAGTCGG CGAATACGCC
CTGAACGGCC AGTACCCGCC GGTGCGGGCC TACACCTGA
 
Protein sequence
MGSPSTPEDW LALAHGIGEE FARDIARRTR TRERPDTQLR RLKDSGLTNL AIPRELGGAG 
QRWSLIVRTI RELAAGDGSV GMLYGYHQLN LVNLRREPQP RRDRLLAEIA ERRLWLAGVV
NPRDDDILAT PDGEGFRLNG RKGFCSGAAF ADLLSVSARH AHDGQRLMAL IPSDRPGLHY
AEDWDHFGVE RSDSGSFVLS EVRAEPGEVI ANDLEDGSDF SAVIRTPVNQ SAFTQFYLGN
ALGALRAARA YVHREGRAWL HAGVDEAHRD PLLVSQFGEL WIALQGAIAL ADRAALKVDE
LLAADEAFTP ELRGEAAVEV ASAKVLAART ALDVTSRVFE VMGARATHNR YAFDRFWRDT
RTHSLHDPLA HKLLEVGEYA LNGQYPPVRA YT