Gene Namu_1814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1814 
Symbol 
ID8447419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1989657 
End bp1991225 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content67% 
IMG OID645040943 
Productpermease for cytosine/purines uracil thiamine allantoin 
Protein accessionYP_003201193 
Protein GI258652037 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.627915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.010101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA AGCAGGTGAC CGACACCCCC GTCGCGGGAG CAGTCGCCGA GCACCATCAT 
CTGTCCATGC ACGACATCAC CCCGGACCAT CCCGCCGGCG CGGGGGTGAT CAAGCCCGGT
TACGACGACC GGCTCACCAA CGAGGACCTG GCCCCGCTGC GCAAGCAGAC CTGGGGTTCC
TACAACTTCT TCGCTTTCTG GATGTCCGAC GTGCACAGCG TCGGCGGGTA CGTCACCGCG
GGCAGCCTGT TCGCCCTGGG CCTGGCCGCC TGGCAGGTGC TGGTCGCCCT GCTGGTCGGC
ATCACCATCG TGTATTTCCT GTGCAACCTG GTGGCCCGGC CCTCGCAGGC CACCGGCACT
CCCTACCCGG TCGCCTCCCG GATCTCCTTC GGCGTGCTCG GGGCGAACAT CCCCGCGATT
ATCCGCGGCC TCATCGCGGT GGCCTGGTAC GGCATCCAGA CCTACCTGGC CTCGGTGGCG
CTGGTCCTGC TGGCGATCAA GCTCTGGCCC GGGCTCGCGC CCTACGCCGA GACCGCGCAG
CACGGTTTTG CCGGGTTGTC GCTGCTGGGC TGGATCGGCT TCATGATCAT GTGGGTCGCC
CAGGCCGTCG TCTTCTGGCG GGGCATGGAG GCCATCCGCA AGTTCATCGA CTTCTGCGGA
CCCGCCGTGT ACGTGGTCAT GTTCGCGCTG GCCATCTACC TGGTCGCCGC GGCCGGCTGG
GAGAACATCG ACTTCAACCT GGCCGAGGGC GGCTTGACCC TGACCGGCTG GGCCGTCATC
CCCGTGCTGC TGTCCGCGAT CGCCCTCGTC GTCTCCTACT TCTCGGGCCC GATGCTGAAC
TACGGCGACT TCGCCCGCTA CGGCAAGTCG TTCGGCGCGG TCAAGAAGGG CAACTTCCTG
GGTCTGCCGG TCAACTTCCT GGTCTTCTCG CTGCTGGTCG TGGTGACCGC GGCGGCCACC
CGGCCGGTGT TCGGCGAGCT GATCATCGAT CCGGTGCACA CCGTGGCCCG GCTGGACAAC
GTCTACGCCG TCATCCTGGG CGCGCTGACC TTCATGATCG CCACCGTCGG CATCAACATC
GTGGCCAACT TCGTCTCCCC CGCCTTCGAC TTCTCCAACG TCAACCCGCA GAAGATCTCC
TGGCGGATGG GCGGCATGAT CGCCGCGATC GGCTCCGTGC TGATCACCCC GTGGAACCTG
TACAACTCGC CGCAGACCAT CCACTACACG CTGGACATCC TGGGCGCCTT CATCGGCCCG
CTGTACGGCG TCCTGATCGC CGACTACTAC CTGGTCAAGC GGCGTCGGGT GAACGTGGAC
GCGCTGTACA CCCTGAGCCC GAACGGCACC TACCACTACC GCAAGGGCTA CAACCCGGTC
GCCGTCGTGG CCACCGCGGT CGCCGCCCTG GCCGGTGTGC TGGTCGTCTT CTTCGCCTCC
ACCGAGGCCG CGACCTACAC CTGGTTCATC GGCGCCGGGC TGGGCTTCGT CCTCTACATG
GTCGGCAGCA AGCTGTTCTC GGTGCAGGCC AACTACCCGA CGGCCGAGCA GATGGGCACC
GCCGCCTGA
 
Protein sequence
MTDKQVTDTP VAGAVAEHHH LSMHDITPDH PAGAGVIKPG YDDRLTNEDL APLRKQTWGS 
YNFFAFWMSD VHSVGGYVTA GSLFALGLAA WQVLVALLVG ITIVYFLCNL VARPSQATGT
PYPVASRISF GVLGANIPAI IRGLIAVAWY GIQTYLASVA LVLLAIKLWP GLAPYAETAQ
HGFAGLSLLG WIGFMIMWVA QAVVFWRGME AIRKFIDFCG PAVYVVMFAL AIYLVAAAGW
ENIDFNLAEG GLTLTGWAVI PVLLSAIALV VSYFSGPMLN YGDFARYGKS FGAVKKGNFL
GLPVNFLVFS LLVVVTAAAT RPVFGELIID PVHTVARLDN VYAVILGALT FMIATVGINI
VANFVSPAFD FSNVNPQKIS WRMGGMIAAI GSVLITPWNL YNSPQTIHYT LDILGAFIGP
LYGVLIADYY LVKRRRVNVD ALYTLSPNGT YHYRKGYNPV AVVATAVAAL AGVLVVFFAS
TEAATYTWFI GAGLGFVLYM VGSKLFSVQA NYPTAEQMGT AA