Gene Namu_4314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4314 
Symbol 
ID8449940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4796892 
End bp4798712 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content66% 
IMG OID645043362 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003203591 
Protein GI258654435 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.986199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGAA CTCGGGCGGT GGCCATCACC GCGCTCGCCG CAGCCACCAT GGTGGTGGTC 
GCAGCGTGTG GGAGCAGCAG CAGTAGCTCC AGCACGACGA GCGCAACGTC GGGCGCCACC
AGCGCGTCGG GCAGTTCTGC GGCCGGCGGT GGCACCTCGG GGGACTCCGG GACGATCACC
TACGCGCAGG AGCAGGAGTG GTACGACTAC AACTCCGGCT CCAGCTCGGG CAACGCCACG
GCCAACCAGG TGGTGCTCAA CCAGGTCCTG CGTGGGTTCT CCTTCGTCGA CAACACCGGC
ACCGTGCAGA TGGACGACGA GTACGGGACC ATCGAGAAGC TCTCGGACAG CCCGCTGACC
GTCAAGTACA CCTTCAACGA CAAGGCCGTC TGGTCGGACG GTCAGCCCGT CGGCTGTGCG
GACTTCCTGC TGGCCTGGGC GGCCAACTCG GGCCGCTACA ACAAGGACGG CACCATCAAC
CAGCCGGGGA CGATCCCGGC CGAGCCGGCT TACATCTTCG ATACCGCCTC CACCTCGGGC
ATCGACCAGA CCCAGAAGCC GACCTGTGCC GATGGTGACA AGTCGGTCAC GCTGACCTAC
GACAAGCCGT TCGTGGACTG GCAGGTCGCG ATCGGTACCT CGGCCGGCTC CAGCATCCTG
CCGGCGCACG TGGCGGCCAA GGCGGCCGGC ATCGACACCG CTGGCCTGGT CAAGGCGGTC
GAGACCGACG ACGTCGCCAC CCTGACCAAG GTTGCGGACT TCTGGAACAC CGGCTGGACC
CTGGACAAGG CCAAGGGCCT GCCGGACGCG ACCACCATTC CCTCGGCCGG CCCGTACTAC
GTGGCCGCCT GGGATCCGGG TCAGTCGGTG ACCCTGAAGA AGAACGACAA GTGGTGGGGC
ACGCCGGCCA AGACCGACAC GGTCGTGATC CGGTACATCT CGCAGGACCA GCAGGTCTCC
GCGCTGGCCT CCGGTGAGGT CGACGTGATC GAGCCGCAGC CGAACCCGGA CGTCAACGCC
GCCCTGAACA ACCTGGGCAA CGCGGTGAAG GCCGACTACG GCTCGCAGTT CACCTACGAG
CACATGGACC TCTCGGTCAA GAACGGCTTC GCCAACGAGA AGCTGCGCGA GGCCCTGTTC
AAGTGCGCCC CCCGGCAGCA GATCGTGGAC AACCTGATCG TGCCGTCGAA CCCGGACGCC
AAGTTGCTGA ACTCGCTGAT GCTGATGGAC TTCCAGCCCG GCTACGACCA GATCTCGCAG
GCCTCGGGCT TCGCCAACTA CGCCGATGTC GACATCGAAG GCGCCAAGGC GGCGTACGCG
GCCTCCGGCG AGGCGCAGGG CAAGACCATC CGGGTCATCC ACATCGATCC GAACCCGCGG
CGGACCAACG AGGTCGCGTT GCTCAAGGCC AGCTGCGACC CGGTGGGCTT CAACATCCAG
GACGTGCCGC TGTCCAGCGA CAAGTTCGGG CCGACCCTGT CCGCCGGTGA CTACGACATC
GCGCTGTTCG CGTGGGCCGG CTCGGGTCTG CTGGGCTCGA TCCCCTCGGA GTACCTGTCC
ACCGGTGGCC AGAACTACTC CGGCTGGAAC GACGCGCAGA TGGACCAGGC CCTCAACAGC
CTGGCCACGC TGACCGATAC CTCGAAGGCC CTTCCGCTGT TGACCACGGT CGACCAGCGG
CTGGCGGCCA ACTACTACTC CTTCCCGATC TTCACCTTCC CGGGAGTGGT GGCCATGAAG
TCGAACATCG AAGGTCCGGT GCTCAACGCC ACGCAGACCC AGGCCACCTG GAACATGCAG
GACTGGGCGC GCACCGGCTG A
 
Protein sequence
MRRTRAVAIT ALAAATMVVV AACGSSSSSS STTSATSGAT SASGSSAAGG GTSGDSGTIT 
YAQEQEWYDY NSGSSSGNAT ANQVVLNQVL RGFSFVDNTG TVQMDDEYGT IEKLSDSPLT
VKYTFNDKAV WSDGQPVGCA DFLLAWAANS GRYNKDGTIN QPGTIPAEPA YIFDTASTSG
IDQTQKPTCA DGDKSVTLTY DKPFVDWQVA IGTSAGSSIL PAHVAAKAAG IDTAGLVKAV
ETDDVATLTK VADFWNTGWT LDKAKGLPDA TTIPSAGPYY VAAWDPGQSV TLKKNDKWWG
TPAKTDTVVI RYISQDQQVS ALASGEVDVI EPQPNPDVNA ALNNLGNAVK ADYGSQFTYE
HMDLSVKNGF ANEKLREALF KCAPRQQIVD NLIVPSNPDA KLLNSLMLMD FQPGYDQISQ
ASGFANYADV DIEGAKAAYA ASGEAQGKTI RVIHIDPNPR RTNEVALLKA SCDPVGFNIQ
DVPLSSDKFG PTLSAGDYDI ALFAWAGSGL LGSIPSEYLS TGGQNYSGWN DAQMDQALNS
LATLTDTSKA LPLLTTVDQR LAANYYSFPI FTFPGVVAMK SNIEGPVLNA TQTQATWNMQ
DWARTG