Gene Namu_4164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4164 
Symbol 
ID8449790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4599586 
End bp4601199 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content69% 
IMG OID645043213 
Productamino acid permease-associated region 
Protein accessionYP_003203442 
Protein GI258654286 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.696427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.49431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCT GGCGCACCAA GTCTGTCGAG CAGTCGATCC GCGACACCGA TGAACCCGGG 
CACAAGCTCA AGCGGAGCCT GAGCGCCTGG GACCTGACCA TCTTCGGCGT GGCCGTGGTG
ATCGGCGCCG GCATCTTCAC CCTCACCGCG CGGGTGGCCG CGACCACGGC CGGCCCGGCG
GTGTCCCTGT CGTTCGTGAT CGCCGCCATC GCCTGCGGCC TGGCCGCCAT GTGTTACGCC
GAGTTCGCCT CGACGGTGCC GGTGGCCGGG TCGGCCTACA CGTTCTCGTA CGCCACCCTC
GGTGAGCTGG TCGCCTGGAT CATCGGCTGG GACCTGGTGC TGGAGCTGGC CCTGGGCGCG
TCGGTGGTGG CCAAGGGCTG GTCGCTGTAT CTGGGCAACC TGTTCTCCCA GTTCGGCGGC
TCGCTGCAGA CCACGATCGA GCTCGGCCCG CTGGACTTCG ACTGGGGCGC GGTGGTGATC
GTTGGTCTGA TCACGTTCGT GCTGGTGCAG GGCACCAAGC TCTCGGCCCG GGCCAACATG
ATCATCACCG CGATCAAGGT GTTCGTGGTG CTGCTGGTGA TCGTGGTCGG CTTCTTCTAC
TTCAACGCCA GCAACCTCTC GCCGTTCGTG CCGCCGAGCC AACCGGCGGC GGAGGGCGGC
AAGACCGGGC TGGAGCAGCC GCTGCTGCAG CTGATCCTGG GCGGCGCGCC GTCCTCGTTC
GGGTGGTTCG GGGTGCTGGC CGCGGCCTCG CTGGTGTTCT TCGCCTTCAT CGGCTTCGAC
ATCGTGGCCA CCGCGGCCGA GGAGACTCGC AACCCGCGCA AGGACCTGCC GCGCGGCATC
CTGGGCTCGC TGGCCATCGT CACCGTGCTG TACGTGCTGG TCTCGCTGGT GCTGACCGCG
ATGGTGCCCT ACGACCAGCT CGGGCCGCTG CAGCCGGACG GCACCCACGA CGGCAACGCC
GCGACCCTGG CCACGGCGTT CTCCGCGGTG GGCGTCGACT GGGCGGCCAA CGTCATCGCC
ATCGGCGCGC TGGCCGGGCT GACCACGGTG GTGCTGGTGC TGCTGCTCGG GCAGAGCCGC
ATCATCTTCG CGATGAGCCG GGACGGCCTG CTGCCGCGCG GAATGGCCAC GGTCAGCGAG
AAGACCGGAA CGCCGGCCCG GATCACCGTC GGGGTCGGCG TGGTCGTCGC CGTCATCGCC
GGTTTCTCCG AGATCGGCGT GCTCGAGGAG ATGGTCAACG TCGGCACCCT GTTCGCCTTC
GTCCTGGTCT CCATCGGCGT GATCGTGCTG CGCCGCACCC GGCCGGACCT GGAGCGCTCG
TTCAAGGTGC CGCTCATGCC GGTGCTGCCG ATCCTGTCGG TGCTGGCCTG CGTGTGGTTG
ATGATCAATC TCACCGCGAT CACCTGGATC CGGTTCCTGG TCTGGATGGC CCTGGGGGTG
GCCGTCTACT ACCTCTACGG CAAGAAGCAC TCCATGGTGG GGCGCCGGGC CGGGGACGGC
CTGGCCCTGA CCGAGGAGGA GCTCAAGGCC ACCTGGCGGG CCGAGGAGGA CTCCGGCTGG
GTCGGGCCGG ACCGAGGGCG GCGGCGTCGT GAGGAGAACG AACCCGGCAG CTGA
 
Protein sequence
MSVWRTKSVE QSIRDTDEPG HKLKRSLSAW DLTIFGVAVV IGAGIFTLTA RVAATTAGPA 
VSLSFVIAAI ACGLAAMCYA EFASTVPVAG SAYTFSYATL GELVAWIIGW DLVLELALGA
SVVAKGWSLY LGNLFSQFGG SLQTTIELGP LDFDWGAVVI VGLITFVLVQ GTKLSARANM
IITAIKVFVV LLVIVVGFFY FNASNLSPFV PPSQPAAEGG KTGLEQPLLQ LILGGAPSSF
GWFGVLAAAS LVFFAFIGFD IVATAAEETR NPRKDLPRGI LGSLAIVTVL YVLVSLVLTA
MVPYDQLGPL QPDGTHDGNA ATLATAFSAV GVDWAANVIA IGALAGLTTV VLVLLLGQSR
IIFAMSRDGL LPRGMATVSE KTGTPARITV GVGVVVAVIA GFSEIGVLEE MVNVGTLFAF
VLVSIGVIVL RRTRPDLERS FKVPLMPVLP ILSVLACVWL MINLTAITWI RFLVWMALGV
AVYYLYGKKH SMVGRRAGDG LALTEEELKA TWRAEEDSGW VGPDRGRRRR EENEPGS