Gene Namu_3910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3910 
Symbol 
ID8449529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4316298 
End bp4317560 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content71% 
IMG OID645042956 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003203192 
Protein GI258654036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.420808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.127498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGCC GACGATTGAC CGGCCGGATC GCAGCCGGCA CCCTGGCCAC CGGCCTGCTG 
CTGGCCGCGT GTGGCAGCCC CGGTTCGTCG TCCTCGTCGC CGACCACGGC GGCCAGTGCC
GCCAGTGCGG CCGGCGGCGG TTCGGCCGGC GCCTCCGCGC AGGCGGCCAC CGGTGAACCG
ATCAAGGTCG GCGTCGTCAC CTCGCTGTCG GGGCCGCTGC AGTCCTACGG GCAGATGTAC
CTGGACGCCT TCAACGTGTG CCTGGACCAC GCCACCAACG GCACCGGCGC GGTGAACGGC
CGGCCGATCG CGGTGGCCAC CGCCGATGAC GCCGGCGATC CGGCCAAGGC AACCACCGCG
GCCACCGACT ACATCGGCCA GGGCTACCAG ATCCTGGCCG GGTCCGCCTC GTCCGGGGTC
GCCCTGCAGG TCGCCCCGCT GGCCCAGGAG AACCAGGTGC TGTTCATCTC CGGGCCGGCC
GCCACCGACG CCATCACCGG GGTCAACAAG TACACGTTCC GCTCGGGACG CCAGACGTAC
CAGGACATTG CGACCGCGGC GTCCTTCGTG GGCGATCTGC AGGGCAAGAA GGTGACGATC
TTCGCCCAGG ACAGCGCGTT CGGCCAGGCC AACGTGGCCG CGGCCTCGGC CGTCTTCGGC
GCCGAGGGGG CCACCGTCAC CCCGCTGCTG GTGCCGGCGA CCGCGACCGA CCTGGTGCCG
TTCGCCAAGC AGGCCGCCGA CGCCGACCCG GATCTGCTGT TCGTGGCCTG GGCCGGCACC
AACGCCACCC AGATGTGGGA GGCGATGGGC CAGCAGGGCG CGTTCGACGG CACCACCGTG
GTCACCGGTC TGGACATCAA GCCCACCCAC ACCGTTTTCG CTCCGGTCGC GGACAAGCTC
TCGCTGCTGG CCCACTACTT CGACGGCGCC ACCGACAACG AGGTGGAGCA GGCGCTGGTC
GCCGGGCTGA CCGCGGAAGG CAAGACGCAG GATCTGTTCT CGCCGGACGG CTGCAACGCG
GCGTTGATGG TGGTGCGGGC GGCGCAGGAG TCGCCGGACG ACGTGGACGG CATGATCACG
GCGCTGGAGG GCTGGGAGTT CGAGGGTCCC AAGGGCACCA CCACGATCCG GGCCGAGGAT
CACGCGATGC TGCAGCCAAT GTTCCAGACC AAGCTGGCCG ATGTGAACGG CACGCTGACC
CCCGAACTGG TCAAGGAGCT GGCACCGGCG GACACCGCCC CGGCCGCGAC GCCCTTCAAG
TGA
 
Protein sequence
MSRRRLTGRI AAGTLATGLL LAACGSPGSS SSSPTTAASA ASAAGGGSAG ASAQAATGEP 
IKVGVVTSLS GPLQSYGQMY LDAFNVCLDH ATNGTGAVNG RPIAVATADD AGDPAKATTA
ATDYIGQGYQ ILAGSASSGV ALQVAPLAQE NQVLFISGPA ATDAITGVNK YTFRSGRQTY
QDIATAASFV GDLQGKKVTI FAQDSAFGQA NVAAASAVFG AEGATVTPLL VPATATDLVP
FAKQAADADP DLLFVAWAGT NATQMWEAMG QQGAFDGTTV VTGLDIKPTH TVFAPVADKL
SLLAHYFDGA TDNEVEQALV AGLTAEGKTQ DLFSPDGCNA ALMVVRAAQE SPDDVDGMIT
ALEGWEFEGP KGTTTIRAED HAMLQPMFQT KLADVNGTLT PELVKELAPA DTAPAATPFK