Gene Avin_21000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21000 
Symbol 
ID7761025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2095267 
End bp2097081 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content71% 
IMG OID643804995 
ProductCysteine desulfurase, SufS-like protein 
Protein accessionYP_002799276 
Protein GI226944203 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.382512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGATC TGGCGCGGCT GGCCAACGCC TTCTTCGCCA GCCTGCCGGG TAGCGCTCCG 
TCGGACGGCG TTCCGTCGGG CGGCGCGCCC TTCGGCAAGG GTCTGTCGGA CAGCGTTCTG
CCGGGCGGCG TTTCGGCGCT GGCCGGCGTG TCGCCCGTCG AGCCGTTTCC GAGTGGGGCC
GCCGCCTCGC TCGGGGCGGC CGAACGGCAC GCCGGGGAGA CGCTTCCGGC GGGCATCGCC
GACGGCCTGG AGCTCGGCTC GCCCGAGGCC TACGCCGCCG CGTTGCCGAC GCTGTTCCCG
GCTGCCGGCG GCCTCGCGCC GTTGGCCGGG GGCGCGCCGT CCACGCCGTA CTACTTCCTC
GGCGAGGGCA GCGCCTACAG CGGCGAGCCG GAGCGCTTCG CGGACCTTCC CGTCGAGCCG
GACAGCCGCG CGGTGCCGGG CGGGGACGCG CTGGGCGAGG TCCTCCGCTC GATCCTCGCC
GAGCCGTCCG CGCCGGCGGC GCCGGCGGGT CCGGGGGCCG GACAGTTCTA TTTCCTCGAA
CGGAGCGGTC CGGGCCTGGA GCAGGCGCCC GATCCCGTCG TCCAGCCGCA GGTCCGCAGC
GGTTTCGACG TGCAGGCGGT GCGCCGGGAC TTCCCGATTC TCGCCGAACG GGTCAACGGC
AAGCCGCTGG TGTGGTTCGA CAACGCCGCC ACCACGCAGA AGCCCAAGGC GGTGATCGAT
CGCCTGGCGT ACTTCTACGA GCACGAGAAC TCCAACATCC ACCGCGCCGC CCACGAGCTG
GCGGCGCGGG CGACGGACGC CTACGAGGGC GCGCGCAGCA AGGCGGCGCG CTTTCTCGGC
GCGAAGTCGA CGGACGAGAT CGTCTTCGTG CGCGGGGCGA CCGAGGGCAT CAACCTGCTG
GCCAACACCT TCGGCCGCCG GTTCATCGGC GAGGGGGACG AGATCATCGT CTCCCACCTG
GAGCACCACG CCAACATCGT GCCCTGGCAG TTGTTGGCCA ACGCGGTGGG CGCCAGGCTC
AAGGTGATCC CGGTGGACGA CTCCGGGCAG ATCATCCTGG AGGAGTACGC CAGGCTGCTC
GGTCCGCGCA CCCGGCTGGT GAGCATCACC CAGGTGTCCA ACGCCCTCGG CACGGTCACC
CCGGTGGCGG AGGTGATCGC CCTGGCGCAT GCCGCCGGTG CGCGGGTACT GGTGGACGGC
GCGCAGTCGG TGTCGCACCT GAAGGTCGAC GTGCGGGCGC TGGACGCGGA CTTCTTCGTG
TTCTCCGGGC ACAAGGTCTT CGGGCCGACC GGCATCGGTG TGGTCTACGG CAAGCAGGAA
CTGCTCGACG AGCTGCCGCC CTGGCAGGGC GGCGGCAACA TGATCGCCGA TGTGACCTTC
GAGAAGACCC AGTACCAGGG TGCGCCGGCG CGCTTCGAAG CCGGTACCGG CAACATCGCC
GATGCGGTGG GGCTCGGTGC GGCGCTGGAC TATGTCGAGC GCATCGGCCT GGAAGCCATC
GCCCGCTACG AGCACGAACT GCTGGAGTAC GCCACCCGGG GCCTCGCGAC GATTCCCGGC
CTGCGCCTGA TCGGTACCGC GGCGAACAAG GCCAGCGTGC TGTCCTTCGT GCTGCAGGGC
TACCGCACCG AGGAGATCGG CGCCGCCCTC AATCGCGAGG GTGTCGCCGT GCGTTCCGGC
CACCACTGCG CGCAGCCGAT CCTGCGCCGT TTCGGGGTGG AAACCACGGT GCGGCCGTCG
TTGGCGTTCT ACAACACCTT CGACGAGATC GACCTGCTGG TGAGCACCGT GCACCGTCTG
GCGACCCGGC GCTGA
 
Protein sequence
MADLARLANA FFASLPGSAP SDGVPSGGAP FGKGLSDSVL PGGVSALAGV SPVEPFPSGA 
AASLGAAERH AGETLPAGIA DGLELGSPEA YAAALPTLFP AAGGLAPLAG GAPSTPYYFL
GEGSAYSGEP ERFADLPVEP DSRAVPGGDA LGEVLRSILA EPSAPAAPAG PGAGQFYFLE
RSGPGLEQAP DPVVQPQVRS GFDVQAVRRD FPILAERVNG KPLVWFDNAA TTQKPKAVID
RLAYFYEHEN SNIHRAAHEL AARATDAYEG ARSKAARFLG AKSTDEIVFV RGATEGINLL
ANTFGRRFIG EGDEIIVSHL EHHANIVPWQ LLANAVGARL KVIPVDDSGQ IILEEYARLL
GPRTRLVSIT QVSNALGTVT PVAEVIALAH AAGARVLVDG AQSVSHLKVD VRALDADFFV
FSGHKVFGPT GIGVVYGKQE LLDELPPWQG GGNMIADVTF EKTQYQGAPA RFEAGTGNIA
DAVGLGAALD YVERIGLEAI ARYEHELLEY ATRGLATIPG LRLIGTAANK ASVLSFVLQG
YRTEEIGAAL NREGVAVRSG HHCAQPILRR FGVETTVRPS LAFYNTFDEI DLLVSTVHRL
ATRR