Gene Avin_21160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21160 
Symbol 
ID7761041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2108848 
End bp2110146 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content70% 
IMG OID643805011 
ProductCondensation domain protein 
Protein accessionYP_002799292 
Protein GI226944219 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA CGCCTTCCCG CTTGCCCGCC ACCACCGCCC AGTTCAGCAT CTGGGTCGCC 
CAGCAGGCCG TACCCGACAG TCCGGCCTAC CTCACGGCCG AAACCATCGA GCTGCTCGGC
GCCGTCGATC GCGCGGCGCT GTCGGACACG GTGGTCGAGG TCCTGAACAA CTGCCACAGC
CTGCACATGC GCTTCGTCCA CGAGGAAGGC CGGCTGTGGC AATTGCCGCA GCCGCCGGCC
TGCCCGCCGC TGGCGCTGCA CGACTTCCGC CGCGAGCCCG ACCCGGCCGC CGCGGCCCAG
GCCTGGATCG ACCGGGAACT GAGCCGCACC CTCGACCCGA GCCGCGACGT GCTCTACCGG
AGCGCCCTGC TGCAACTGGG CGACCAGCAC CATCTCTGGT ACCTGCAGGC CCACCATGTG
ACGCTGGACG GCTACGCCTA CGGCCTGGTC TGCCAGACCA TCGCCGAGCG CTACAGCGCC
AGGGTCCGGC AAAGCGAGCT GCCGCCCCTG CCGGACTGGT CGATGGCGCG GGTGGTGCAG
GCCGAGCAGG ATTACCGGGC CAAGGGACTG TTCGAGCGCG ACAAGCAGTT CTGGCTGCAG
CAGCTGCGGG ATGTCCCGGC ACCGGCCACC ATCGCCGAGG CGGCGGAGTT TCCCGAGCAG
GTCATTCGCA GCGAAACCCG GCTGCAACGC GACCAGGTGG CCATGCTGCA GGAGGCGGCC
CGCGCCTGCG GGCAGGACTG GGGCAACTGG GTGCTGGCGG CCATCGGACT CTGGCTGGGC
AGGCAATCCG GCCAGCAGGC GCTGACCTTC GCCCTGCCGG CGATGAACCG GCTGGGCACC
CCGGCGCTGG GCGTACCCTG CATGGCGATG AACATAGTGC CGCTGAGCCT GCGCATCGAC
CCGGCCGGCT CGATGGCCAG CCATGCGCAG CAGATCGCCG GCGAGATGCG CCGCATCCGC
CCGCACCTGT ATTACCGCTA CGGCTGGATG CGCATGGACC TGGGCCTGAT GGAGGCGCAG
AAGCACCTGT TCAACCAGGC GGTCAACCTC ATGCCCTTCG ACCGCAAGGC CGCCTTCGCC
GGGCTGGACA GCCGCATCAG GCCCATCACC GCCGGACCGG TCAAGGACCT CAACGTCACC
CTGGCGGTAC TGGACGCCGA GTGGCGGCTC TGCGTCGAAG CCAATCCCCA CGCCTACTCC
AGCGACCGAC TGGAGCAATT GCACGGGGAC CTGCTGGACT GGCTGCAACG GCTGGCCAGG
CACGCCCCGG AGGCGCCCCT GCAGGCGCTG TGGCAATAA
 
Protein sequence
MNATPSRLPA TTAQFSIWVA QQAVPDSPAY LTAETIELLG AVDRAALSDT VVEVLNNCHS 
LHMRFVHEEG RLWQLPQPPA CPPLALHDFR REPDPAAAAQ AWIDRELSRT LDPSRDVLYR
SALLQLGDQH HLWYLQAHHV TLDGYAYGLV CQTIAERYSA RVRQSELPPL PDWSMARVVQ
AEQDYRAKGL FERDKQFWLQ QLRDVPAPAT IAEAAEFPEQ VIRSETRLQR DQVAMLQEAA
RACGQDWGNW VLAAIGLWLG RQSGQQALTF ALPAMNRLGT PALGVPCMAM NIVPLSLRID
PAGSMASHAQ QIAGEMRRIR PHLYYRYGWM RMDLGLMEAQ KHLFNQAVNL MPFDRKAAFA
GLDSRIRPIT AGPVKDLNVT LAVLDAEWRL CVEANPHAYS SDRLEQLHGD LLDWLQRLAR
HAPEAPLQAL WQ