Gene Avi_1374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1374 
SymbolpepN 
ID7389110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1152801 
End bp1155479 
Gene Length2679 bp 
Protein Length892 aa 
Translation table11 
GC content59% 
IMG OID643650781 
Productaminopeptidase N 
Protein accessionYP_002548987 
Protein GI222148030 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTACAG AAACGGGCCA GGTTGTCAGC CTGGCGGATT ATCGCCCGAC CGAGTTTGTC 
CTGGAGCGAG TGGACCTGAC TTTCGAGCTT GATCCAACAG ACACAAAGGT CGAGGCCCGC
CTGATCTTCC ATCGTCGTGA AGGTGCCGAT GTCGCCGCAC CGCTGGTGCT GGACGGCGAC
GATCTGGTTC TCTCCAGCGT GCTGTTCGAC CAGATCGAAC TGGAGCCGGA GCGCTACAGC
GCCACCGCGC GCTCATTGAC GATCCGCGAC CTTCCTGCTG CCGAACCCTT TGAGATCACC
ATTACCACGC TGATCAATCC TGAGGCCAAT ACCCAGTTGA TGGGGCTTTA CCGCTCGAAC
GGCATCTATT GCACCCAGTG CGAGGCCGAG GGCTTCCGCC GCATCACCTA TTTCCCGGAT
CGACCGGACG TGCTGTCGGT CTATACCGTC AATATCATCG CTGACAAGCA AGCCAATCCG
CTGCTGTTGT CGAACGGCAA TTTCCTGGGT GGCGCGGGCT ATGGCGAGGG CAAGCATTTT
GCCGCATGGT TTGATCCCCA TCCCAAGCCC AGCTACCTGT TTGCACTGGT GGCAGGCGAT
CTCGGCCTGA TCGAGGACAC GTTCACCACG GTGTCGGGCC GCGAGGTCGC GTTGAAAATC
TATGTGGAGC ACGGCAAGGA ACCACGCGCC GCCTACGCCA TGGATGCGCT GAAGCGCTCG
ATGAAATGGG ATGAGGATGT ATTCGGGCGC GAATACGATC TGGATATTTT CATGATCGTC
GCCGTGTCGG ATTTCAACAT GGGGGCCATG GAAAACAAGG GCCTGAATGT CTTCAACGAC
AAATATGTGC TGGCCGACCC GCAAACCGCC ACCGATGCCG ACTATGCCAA TATCGAAGCG
ATCATTGCCC ACGAATATTT CCATAACTGG ACCGGCAACC GCATCACTTG CCGCGACTGG
TTCCAACTCT GCCTGAAGGA AGGGCTGACA GTCTATCGCG ATCACCAGTT TTCCGCCGAT
CAGCGCTCAC GCGCCGTCAA GCGCATCGCC GAAGTCCGGC ATTTGCGCGC CGAACAGTTT
GTGGAGGATT CCGGACCGCT CGCGCATCCG GTACGCCCGA ATACCTACAA GGAAATCAAT
AACTTCTACA CAACCACCGT CTATGAAAAA GGCTCTGAAG TCACCGGGAT GATCGCCACC
ATCCTCGGGC CGGACCTGTT CAAGGCTGGG ATGGATCTCT ACTTCGAGCG TCATGACGGC
GATGCGGCAA CGGTCGAGGA TTTCGTTGCC TGTTTTGCGG AGGTCTCCGG GCGCGACCTT
ACCCAGTTCT CTCTCTGGTA TAATCAGGCC GGTACGCCGA ATGTGACGGT ATCCTGCGAC
TACGATGCTG GCACGAGCCT GTTTACTGTC GAGTTGGAAC AAGTGATCCC GCCGACACCC
GGCCAACCCA ACAAGCAGCC GATGCATATT CCGCTGCGCT TTGGGCTTCT GGCGGCAGAT
GGCACACCAC TCGATACAGC AACCGTGGAC GGTGGTGAAA TCAGCGGCGA CGTGCTGCAT
CTGACCGAAC GTCGGCAGGT ATTCCAATTT TCAGGCGTGA AGGAACGTCC GGTCCTGTCG
CTCAACCGGG GCTTTTCGGC ACCCGTCATC CTGCATTTCA GCCAGGCAAG TGCGGATCTG
GCGCTGATCG CCCGCCACGA TAGCGACCTG TTCTCCCGCT GGCAGGCGCT GACCGACCTC
GCCTTGCCGG TTCTATGCGA CAACGCGCGC CAACTCTCCA CCAAGAGCGG CACAGCCAAG
AGCGCAACCG ATAAAGCATC CATCGCGGCA AGCGATGCCC TGAAACAAAG CCTGCTTGCC
ATCATTGCCG ACGATGCGCT TGAGCCTGCC TTCCGTGCCC AGGCGCTGGC GCTACCGAGC
GAAGCCGATA TCGCCCGCGA ACTGGGCAGT GACATCGACC CGGACGCCAT CCATCAGGCA
CGCCAGGCGG TGCTGGCTGA TATCGCCATT GAGGGCGCGG ATCTATTTGC CCGCCTTTAC
GATAACATGG CGACTGAGAC CCCCTACAGT CCGGATGCTA CTGCCGCAGG CAAAAGAGCG
CTGAAAAATG CCGCGCTTGG CTATCTCGTT CAGGCTAAAG GCGAGCCGGC CAAGGCCGCC
GAGGCCTATG GCAAAGCCGA CAATATGACC GACCTGTCGC ATGCGCTCGG CGTTCTCGCC
TATCACTTCG GCGATACGGA AGAGGCGCAG GCTGCACTGG CCAATTTCCA GACACGGTTT
GCCCAGAACG CATTGGTACT GGACAAGTGG TTTTCCATCC AGGCCACCAT TCCGGGACAC
GGTGCGCTGG AACGGATTGA AGCACTCATG CAAAATCCGC TGTTCAACGC CAGCAATCCG
AACCGCGTTC GCGCGCTGAT CGGCAGCTTT GCCTTTTCCA ACCCGACCGG CTTCCACCGC
GCCGATGGCA AGGCCTATAA CTTTCTTGCC GAGGAAATTC TGGCCATCGA CAAGCGCAAT
CCGCAGCTTG CCGCCCGGCT GCTGACCTCG ATGCGCACCT GGCAAAAGCT TGAGCCCGTT
CGTGCAGCCA AGGCCAGGGC GGCCCTTGCC CTTATCGAAA GCTCAAACGG CCTTTCCAAC
GATGTCCGAG ATATTGTTGA GCGGATGCTG AAAGGCTGA
 
Protein sequence
MRTETGQVVS LADYRPTEFV LERVDLTFEL DPTDTKVEAR LIFHRREGAD VAAPLVLDGD 
DLVLSSVLFD QIELEPERYS ATARSLTIRD LPAAEPFEIT ITTLINPEAN TQLMGLYRSN
GIYCTQCEAE GFRRITYFPD RPDVLSVYTV NIIADKQANP LLLSNGNFLG GAGYGEGKHF
AAWFDPHPKP SYLFALVAGD LGLIEDTFTT VSGREVALKI YVEHGKEPRA AYAMDALKRS
MKWDEDVFGR EYDLDIFMIV AVSDFNMGAM ENKGLNVFND KYVLADPQTA TDADYANIEA
IIAHEYFHNW TGNRITCRDW FQLCLKEGLT VYRDHQFSAD QRSRAVKRIA EVRHLRAEQF
VEDSGPLAHP VRPNTYKEIN NFYTTTVYEK GSEVTGMIAT ILGPDLFKAG MDLYFERHDG
DAATVEDFVA CFAEVSGRDL TQFSLWYNQA GTPNVTVSCD YDAGTSLFTV ELEQVIPPTP
GQPNKQPMHI PLRFGLLAAD GTPLDTATVD GGEISGDVLH LTERRQVFQF SGVKERPVLS
LNRGFSAPVI LHFSQASADL ALIARHDSDL FSRWQALTDL ALPVLCDNAR QLSTKSGTAK
SATDKASIAA SDALKQSLLA IIADDALEPA FRAQALALPS EADIARELGS DIDPDAIHQA
RQAVLADIAI EGADLFARLY DNMATETPYS PDATAAGKRA LKNAALGYLV QAKGEPAKAA
EAYGKADNMT DLSHALGVLA YHFGDTEEAQ AALANFQTRF AQNALVLDKW FSIQATIPGH
GALERIEALM QNPLFNASNP NRVRALIGSF AFSNPTGFHR ADGKAYNFLA EEILAIDKRN
PQLAARLLTS MRTWQKLEPV RAAKARAALA LIESSNGLSN DVRDIVERML KG