Gene CHU_1442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1442 
SymbolpepN 
ID4186239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1668101 
End bp1671340 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content41% 
IMG OID638071435 
Productaminopeptidase N 
Protein accessionYP_678053 
Protein GI110637846 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.956828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGAAAT TAATTATGAG ATATACGTTT CTTGCCCTGT TGCTTCTGGC TTATTTCGTT 
TCTTTTTCAC AGGCCAGTCC TTTTCCAGCC AATACATTCC GTTCAAAGGC AAATAAAAAC
TATTGGAAAA ACAAAGCACC AAGAGCTGAC TACTGGCAGC AGGATGTGCA TTATACCATT
GATGCAACAC TGGATGATTC TCTGAATACC ATAACAGGGA ATTCTTACCG GCTTGTATAC
TGGAATAACT CTCCGGATGA ACTGAAGGAA TTATATTTCC ATCTTTTTCA GAATATGGCT
CAACCCGGTT CGTATTATGA AAATTTAAAT CAGAATAATA AAATTCCGAT CAAGTACGGC
AAATACGAGC AGGAAGGATT GGGTACAACT ACTGAGAACA TTCAGGTAGA CGGGCAAGCA
GTAAAACTTG AATTAGACAA TACGATACTG AAAATATTTC TGAATAAGCC GCTGCGAAGC
GGTGATTCAG TGGTCGTAAC CATGACGTTC AAAACGTATT ACGACAATGG CAGCATGCGC
CGCCGCATGA AATTCTATGA AACATTTAAC ACCAAACATT TTGACGGCGT GTTCTGGTAC
CCCTCTGTAT GTGTATATGA TTCTAAATTC GGATGGACAA CAGATCAGGA TCTGGACAAA
GAATTTTATC ACGACTTCGG TACGTTTGAT GTTGCATTAA CGTTTCCGCA GGAATATGTA
CTTGACGCCA CAGGAGTATT GCTAAATGAA AAAGAAGTAC TACCCGATTC GTTACGTGCA
AAACTCGACA TTAAAAACTT TGCAAAGAAA CCATTCAATG AAGCTCCATC CATCATTATT
CCCAAAGTTG CCGGTAAAAC AAAGACCTGG TACTTCCACG CGGAAAATGT ACACAACTTT
GCCTTCACAG CTGATCCGTT ATACCGCATT GGTGAAACAT CATGGAACGG TGTACGTGTA
ATAACACTGG CACAGGAACC GCATGCTTCT AAATGGCAGC AGTCTGGCTG GTATACCGCA
CAGATCATTA AAACATACTC CAATGATTTC GGCATGTATG ACTGGCCTAA GATTATTGTT
GCGGATGCAA AAGACGGGAT GGAATATCCA ATGCTTACCT TAGATAATGG TACGTATCCG
CAGCACCAGT ATTTGTTGTC GCATGAAGTC GGACACATGT GGTTTTATGG CATGGTTGGT
TCAAACGAAA CATACCGTGC CTTTATGGAC GAAGGCTTCA CGCAGTTTTT AACAGTATGG
TCAATGGATA AAATCCTGGG AGCAAAACGT GACCGCATCC ACCCTAATAA GCGTGTCGAC
AAATACATTG ATTCGGCTGA TAACAGGTAT GAAAATTTAT ATTATCCGTA CCTTAACCAT
GTTACAGAAA ATTTTGATGA ACCGATCAAC ACACATTCGT GTGCGTTTAA CGGAGCAGTA
AGGCATAGCG GTAATTATGG TTTGGTTTAT TACAAAGCCG GAACTATGTT ATACAATTTA
AAGTATGTAC TGGGCGATTC GCTTTTTATA GGAGCAATGA AACACTATGT GCAGAAATGG
AAATTCGCGC ACCCCTACCC CGAAGATTTC AGAGATGCGA TTATTGAATA TACACACGTT
GATCTGAACT GGTTCTTTGA TCAATGGCTT GAAACAACCA AGTACATTGA CTATAATGTT
AAATCGGTAA AGCATGTATC CGAAGCTGAC AATCAGCATA CGTATGCGCT TACGTTTGAA
CGGTTGGGCC GCATGAATAT GCCGCTAAAA TTTGTTGTTA TATGTAAAGA TAGTACAAAG
CAACATTATT TAATACCGAA CACCTGGTTT GTTCCCCCAA CAAAAGACAG TGTATTAAAA
AAATGGTATG GATGGGATCT GCTTCAGCCT ACGTATACAA CAGAAATTAA AACGTCTTCT
GAAATCAAGT CTGTGATTAT TGATCCGGAA ATGCTGCTTG CGGACATCAA TAATTCAAAC
AATGAATGGG GAAAAAAAAC AAACCGTACC TGGCAATTTG ACCACCGTGT TCCGACTGCT
AAAAACTGGA AGTACCGGAG AAATTATGTT CGTCCGGATG TTTGGTACAA TGGGTTTGAT
GGTGTTCAGC TTGGCGCACA TGCTGAAGGT AAATATTTCA ACCGGTATAA TTACCACATC
AGTGTTTGGG GCAATACCAC GCTTGGTCAG TATATACGGC AGCCAACAGA TCTCAAACCT
CAGTACATTG CTTTTAATGC AATGTATAAC CGTCAGTTAA GTTTGCTTTC AAAAGAACTA
TCCGCTAATA CACAATTGGC TTTCAATGCC GGTATATGGA AAGGTATAGT TGGTTTTGAA
AAAATATTCC GCACACAGGA TCAACGGAAC ACGAGATATA CCAGGCTTTC TGTATATGCA
AAATACCTGG TAAACGAATA TAATTATCAG CCTTATTTAT TATATCCTTC TGAATGGGGT
GTGCGGAATC AATCGACACA ATGGATTAAT GCGTCATTGA ATATTTCATT GTTACGCAAT
TACACGTATA CAAAAGGCAG CGGAACGGTA AGTATTGGCT TACGTGTACC ATTCATCGCA
AGTGATTACA ACTATTCACA GATTACCGGT GAAGCGAAAA ATACGTATGC GCTGAAAAAG
CTGGATCTAA AAACGCGCAT CTTTGCTCAG CTAGGGCTGA ATGATGTACC GCTTGAAAGT
TCGTTGTATG CAGCGGGTGC AAACCAGGAG CAACTGATCG ATAATAAATA CACGCGCGCG
ATTGGTTTTG TGCCGTACGA CTGGTTGAAT TATCAGAACG CAACCAACCA TTTTCAATAT
GGCGGCGGCT TAAATCTGAG AGGTTATGCA GGCAGTTATT GCGCAGAAAA AATAAGCACA
CCTACCGGCG ATAGTATTGT GTATGCATAC AATGGTCAAT CAGGCGGCGC CGTTAATCTC
GAACTGGATT TTGATAAGTA CATAAAAGTA AAAGCGAAAG GAATTACTAA GAACATCCAT
CTTGACATGT ATGGTTTCTT TGATGCAGGT ATCTTAAATT ATAATGTCGC AACAAAAAAA
TACTGGAGCA GCGTACGTAT GGATGCCGGC TTGGGAACAG CCATGACAAT AAAATTCACT
CCATACGACA TTACTCCGCT TACCATCCGG GCAGATTTCC CCTTGTGGAT CAACACACCT
GTTGACGGCA CAAATTATGC TGACTTCAGA TGGGTGTTGT CGGTGAACAG AGCTTTTTAA
 
Protein sequence
MSKLIMRYTF LALLLLAYFV SFSQASPFPA NTFRSKANKN YWKNKAPRAD YWQQDVHYTI 
DATLDDSLNT ITGNSYRLVY WNNSPDELKE LYFHLFQNMA QPGSYYENLN QNNKIPIKYG
KYEQEGLGTT TENIQVDGQA VKLELDNTIL KIFLNKPLRS GDSVVVTMTF KTYYDNGSMR
RRMKFYETFN TKHFDGVFWY PSVCVYDSKF GWTTDQDLDK EFYHDFGTFD VALTFPQEYV
LDATGVLLNE KEVLPDSLRA KLDIKNFAKK PFNEAPSIII PKVAGKTKTW YFHAENVHNF
AFTADPLYRI GETSWNGVRV ITLAQEPHAS KWQQSGWYTA QIIKTYSNDF GMYDWPKIIV
ADAKDGMEYP MLTLDNGTYP QHQYLLSHEV GHMWFYGMVG SNETYRAFMD EGFTQFLTVW
SMDKILGAKR DRIHPNKRVD KYIDSADNRY ENLYYPYLNH VTENFDEPIN THSCAFNGAV
RHSGNYGLVY YKAGTMLYNL KYVLGDSLFI GAMKHYVQKW KFAHPYPEDF RDAIIEYTHV
DLNWFFDQWL ETTKYIDYNV KSVKHVSEAD NQHTYALTFE RLGRMNMPLK FVVICKDSTK
QHYLIPNTWF VPPTKDSVLK KWYGWDLLQP TYTTEIKTSS EIKSVIIDPE MLLADINNSN
NEWGKKTNRT WQFDHRVPTA KNWKYRRNYV RPDVWYNGFD GVQLGAHAEG KYFNRYNYHI
SVWGNTTLGQ YIRQPTDLKP QYIAFNAMYN RQLSLLSKEL SANTQLAFNA GIWKGIVGFE
KIFRTQDQRN TRYTRLSVYA KYLVNEYNYQ PYLLYPSEWG VRNQSTQWIN ASLNISLLRN
YTYTKGSGTV SIGLRVPFIA SDYNYSQITG EAKNTYALKK LDLKTRIFAQ LGLNDVPLES
SLYAAGANQE QLIDNKYTRA IGFVPYDWLN YQNATNHFQY GGGLNLRGYA GSYCAEKIST
PTGDSIVYAY NGQSGGAVNL ELDFDKYIKV KAKGITKNIH LDMYGFFDAG ILNYNVATKK
YWSSVRMDAG LGTAMTIKFT PYDITPLTIR ADFPLWINTP VDGTNYADFR WVLSVNRAF