Gene Namu_1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1151 
Symbol 
ID8446747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1270073 
End bp1271461 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content67% 
IMG OID645040288 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003200547 
Protein GI258651391 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCA GCAACACCCC GCCGACAAGC ACTGGTGTTC CCGTTCCCGA CAAGCCCAAG 
AAGAAGATCA GTACCCACTG GCTCTACATC GCGGTCATCG TCGCGGTGGT CGCCGGTGTC
GCGGTCGGTC TGATCTTCGG CAAGAGCGCC GCCGGCCTGT CGATCATCGG CACCGCGTTC
GTCAACCTGA TCAAGATGAT GATCGCGCCG ATCATCTTCT GCACGATCGT GCTGGGCATC
GGCTCGGTCC GCAAGGCGTC CCAGGTCGGC AAGGTCGGCG GCCTGGCCCT GCTCTACTTC
ATCGTCATGT CGACGTTCGC CCTCGGGCTG GGCCTGCTGG TGGGCAACCT GGTCCACCCC
GGCGACAGCC TGCAGGCCGC CGCGGCGACC GCGACCTACA CCGTGCCGGC CGCGGCCGAG
AGCTCGGGCA ACTTCATCAT CGACATCATC CCGCACTCCC TGCTGGGCGG GCTGACCGAG
GGCAACGTGC TGCAGGCCCT GTTCGTGGCC CTGCTGGTCG GCTTCGCGGT GCAGGCCCTG
GGCAGCAAGG GTGAGCCGAT CATCGGCGCC ATCACCGTCC TGCAGCGTCT GGTCTTCAAG
ATCCTGGCCG GCATCATGTG GCTGGCCCCG ATCGGTGCCT TCGGTGCCAT CGCCGGCGTC
GTCGGCAACG CCGGCTGGGC CGCCATCGGT GCGCTGTCGC TGTTCGTCGC CGTCTTCTAC
GCCACCTGCG TCGCCTTCAT CGTGGTCATC CTGGGCGGCC TGCTCAAGGT CACCACCGGC
CTGTCGATCT TCAGCCTGCT CAAGTACCTG CGCCAGGAGT ACCTGCTGAT CGTGGCGACC
AGCTCCTCCG AGACCGCCCT GCCCCGCCTC ATCGCCAAGA TGGAGCACCT GGGTGTGTCC
AAGCCCGTCG TCGGCATCGT GGTGCCGACC GGCTACTCGT TCAACCTGGA CGGCACCGCC
ATGTACCTGA CCATGGCCTC GCTGTTCATC GGCAACGCCA TGGGCACCCC GCTGACCTGG
AGCGAGCAGC TGTCCCTGCT GCTGTTCATG ATCATCGCTT CCAAGGGCGC CGCCGGGGTC
ACCGGCGCCG GCCTGGCCAC CCTGGGTGGC GGCCTGGCCT CGCACCGCCC CGACCTGGTC
CCCGGTGTCG GCCTGATCGT CGCCGTCGAC CGGTTCATGT CCGAAGCCCG CGCCCTGACC
AACTTCTCCG GCAACGCCGT CGCCACCCTG GTCATCGCCC ACTGGACCAA GGAGGTCGAC
TACAGCCAGA CCAAGCGGGT CTTCACCGGT GAGGATCCCT TCGACGACGC CGACATGCTC
GACGAGCACT CGGCCGCCGA GCACCGGGCG GACGTCGAGT CCTACAAGAC CAAGGAACTG
GCCCACTGA
 
Protein sequence
MTTSNTPPTS TGVPVPDKPK KKISTHWLYI AVIVAVVAGV AVGLIFGKSA AGLSIIGTAF 
VNLIKMMIAP IIFCTIVLGI GSVRKASQVG KVGGLALLYF IVMSTFALGL GLLVGNLVHP
GDSLQAAAAT ATYTVPAAAE SSGNFIIDII PHSLLGGLTE GNVLQALFVA LLVGFAVQAL
GSKGEPIIGA ITVLQRLVFK ILAGIMWLAP IGAFGAIAGV VGNAGWAAIG ALSLFVAVFY
ATCVAFIVVI LGGLLKVTTG LSIFSLLKYL RQEYLLIVAT SSSETALPRL IAKMEHLGVS
KPVVGIVVPT GYSFNLDGTA MYLTMASLFI GNAMGTPLTW SEQLSLLLFM IIASKGAAGV
TGAGLATLGG GLASHRPDLV PGVGLIVAVD RFMSEARALT NFSGNAVATL VIAHWTKEVD
YSQTKRVFTG EDPFDDADML DEHSAAEHRA DVESYKTKEL AH