Gene Namu_4204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4204 
Symbol 
ID8449830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4646878 
End bp4648152 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content64% 
IMG OID645043253 
ProductABC transporter related 
Protein accessionYP_003203482 
Protein GI258654326 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.257703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAACG AGATCGCGGT CAGCGTGGAA CAGGTGTCCA AGCGCTTCCG GATGTTCACC 
GAGCGCAACC AGTCGCTGAA GGCCGCGCTC ATGCGCGGGA AGCGAGCGAC GTTCAACGAG
TTCTGGGCCC TGCGGGACGT GACATTCGAC ATTCCGGAAG GTTCGACCTT CGGTCTGATC
GGAGAGAACG GATCAGGCAA GTCCACGCTG CTTAAATGCA TTGCGAAGAT CTTGCGGCCG
GACACGGGCC GGGTGATCAG TCGCGGCCGG GTGGCCGCCC TGCTCGAACT CGGATCGGGT
TTCCATCCAG AATTGTCGGG CCGGGAAAAT GTGTATCTGA ACGGATCGAT CCTGGGGCTG
TCGCGCAAGG AACTCGATGC GAAATTCGAC GAGATCGTTG ATTTTTCCGG CATCGAGAAA
TTCATCGACC AGCCGGTCAA GAATTACTCG TCCGGGATGT ATGTCCGGTT GGGTTTCGCG
ATCGCGATCA ATATCGACCC GGACATCCTG CTCGTCGACG AGATCCTGGC GGTGGGTGAC
GCCGCGTTCC AGGAAAAATG CATGGAGAAA TTCGCCGACT TCCGGCGGGC GGGCAAGACG
GTGATCGTGG TCAGCCATGC CATGGGGTCG ATGCGCACGA TGTGTGACTA TGCGGCCTGG
CTGAAGCAGG GTCAGCTGGT CGAGGTCGGC ACCACCGAGC AGACCGTCGA TGATTACGTG
GACGCGACCC ACGCCCAGCG CGATCCCGGG TCCGGCTCGG GCCAACAGGC CCGCTGGGGC
TCGGGTGAGG CGACCCTCAC GGCGGTCGAG TTGATCAACG CCGCCGGAGT GCCGACGGCC
AGTTTCCGGA CGGGCGATCG CATGACCATC CGGCTGAATT ATCACTTCGC CGAGCGCATC
GAGCAGCCGG TGTTCGCGCT GAATGTGAAT GCGCTTGACG GTTACCACTT GTGGGGCTTT
CACAGCCGGG ACGGCGGTCG CGTCCCGGAG GCGCTGGAGG GCGCCGGCAG CATCGAACTT
GAGATCCCGC AGCTCATGCT GCAGCCGGGC ACGTTTGACC TGACCGCGTC GATCGTGGAC
TACTCCACGA CGCACGTCTT TGACTTCCTG CGCAACTGCG CGCGATTCGA CGTCGAGGTC
GGCACGCCGA GGGAATCCGG CGGCCCCATC GCGTTGGGCG GGCGGTGGGG TCAGCCGTTG
GCGGGCGCCG CGCCGGGGGC AGACGGCCGC GATGTCGCCG GGCTGGCCGG GACCGGGCCG
GAGGGAAGCC GGTGA
 
Protein sequence
MSNEIAVSVE QVSKRFRMFT ERNQSLKAAL MRGKRATFNE FWALRDVTFD IPEGSTFGLI 
GENGSGKSTL LKCIAKILRP DTGRVISRGR VAALLELGSG FHPELSGREN VYLNGSILGL
SRKELDAKFD EIVDFSGIEK FIDQPVKNYS SGMYVRLGFA IAINIDPDIL LVDEILAVGD
AAFQEKCMEK FADFRRAGKT VIVVSHAMGS MRTMCDYAAW LKQGQLVEVG TTEQTVDDYV
DATHAQRDPG SGSGQQARWG SGEATLTAVE LINAAGVPTA SFRTGDRMTI RLNYHFAERI
EQPVFALNVN ALDGYHLWGF HSRDGGRVPE ALEGAGSIEL EIPQLMLQPG TFDLTASIVD
YSTTHVFDFL RNCARFDVEV GTPRESGGPI ALGGRWGQPL AGAAPGADGR DVAGLAGTGP
EGSR