Gene Avin_11650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11650 
SymbolpepA 
ID7760107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1119154 
End bp1120644 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content68% 
IMG OID643804067 
Productleucyl aminopeptidase 
Protein accessionYP_002798369 
Protein GI226943296 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTCG TTGTCAAAAG TACCAGCCCG CAAACCCTGA AAACTGCAAC GCTGGTGGTT 
GCCGTCGGCG AAGGCCGCAA ACTGGGCGCC ACCGCCAAGG CCATCGACCA GGCCGCCGAC
GGCGCCCTGT CGGCCGCCCT CAAGCGCGGC GACCTTGCCG GCAAGGTCGG ACAGACCCTG
CTGCTGCACG CGGTGCCGAA CCTCAAGGCG GAGCGCGTGC TGCTGGTCGG CGCCGGCAAG
GAGGGCGAGT TGAGCGACCG CCAGTTCCGC AAGATCGCCG CTGCAACCTA CGGCGCGCTG
AAGGGCCTGG GCGGCAGCGA TGCCGCCCTG ACCCTGGGCG AACTCCAGGT CAAGGGACGT
GACACCTATG GCAAGACCCG CCTGCTGGCC GAGACCCTGC TCGATGCCAC CTACGCGTTC
GATCGCTTCA AGAGCGAGAA GGCCTCCGCG CCGGTCCTGA AGAAGCTCGT CCTGCTCTGC
GACAAGGCCG GCCAGGCCGA GGTGGAGCGC GCCGCCAGCC ATGCCCAGGC GATCGTCGAC
GGCATGGCCC TGACCCGCGA CCTCGGCAAC CTGCCGCCGA ACCTCTGCCA TCCGACCTCC
CTGGCCAGCG AGGCCAAGGC GCTGGCCAAG ACCTACGACA CCCTGAAGGT CGAAGTCCTC
GACGAGAAGA AACTCAAGGA GCTCGGCATG GGCGCCTTCC TCGCCGTGGC CCAGGGCAGC
GACCAGCCGC CACGGCTGAT CGTGCTCGAC TACCAGGGCG GCAAGAAGGA CGAGCAACCC
TTCGTGCTGG TCGGCAAGGG CATCACCTTC GACAGCGGCG GCATCAGCCT CAAGCCGGGT
TCGGGCATGG ACGAGATGAA GTACGACATG TGTGGCGCCG CCAGCGTGCT CGGCACCTTC
CGCGCCCTGC TCGAACTGGC GCTGCCGATC AACGTCGTGG GCCTTCTGGC CTGCGCCGAG
AACATGCCCA GCGGCGGCGC CACCCGCCCC GGCGACATCG TCACCAGCAT GAGCGGGCAG
ACCGTGGAGA TCCTCAACAC CGACGCCGAA GGCCGTCTGG TGCTGTGCGA CGCCCTCACC
TACGCCGAAC GCTTCAAGCC GCAGGCGGTG ATCGACATCG CCACCCTCAC CGGCGCCTGC
ATCACCGCCC TGGGCACCCA GGCCTCGGGC CTGATGGGCA ACGACGACGA CCTGATCCGC
CAGGTCCTCG AGGCCGGCGA ACATGCCGCC GACCGCGCCT GGCAGTTGCC GCTGTTCGAG
GAATACCAGG AGCAGCTCGA CAGCCCGTTC GCCGACATGG CCAATATCGG CGGCCCCAAG
GCCGGCACCA TCACCGCCGC CTGCTTCCTC TCGCGCTTCG CCAAGAACTA CCACTGGGCG
CACCTGGACA TCGCCGGCAC GGCCTGGATC AGCGGCGGCA AGGAAAAGGG CGCCACCGGC
CGCCCGGTAC CGCTGCTGAC CCAGTTCCTG CTGGACCGCA GCGCCCCCTG A
 
Protein sequence
MQLVVKSTSP QTLKTATLVV AVGEGRKLGA TAKAIDQAAD GALSAALKRG DLAGKVGQTL 
LLHAVPNLKA ERVLLVGAGK EGELSDRQFR KIAAATYGAL KGLGGSDAAL TLGELQVKGR
DTYGKTRLLA ETLLDATYAF DRFKSEKASA PVLKKLVLLC DKAGQAEVER AASHAQAIVD
GMALTRDLGN LPPNLCHPTS LASEAKALAK TYDTLKVEVL DEKKLKELGM GAFLAVAQGS
DQPPRLIVLD YQGGKKDEQP FVLVGKGITF DSGGISLKPG SGMDEMKYDM CGAASVLGTF
RALLELALPI NVVGLLACAE NMPSGGATRP GDIVTSMSGQ TVEILNTDAE GRLVLCDALT
YAERFKPQAV IDIATLTGAC ITALGTQASG LMGNDDDLIR QVLEAGEHAA DRAWQLPLFE
EYQEQLDSPF ADMANIGGPK AGTITAACFL SRFAKNYHWA HLDIAGTAWI SGGKEKGATG
RPVPLLTQFL LDRSAP