Gene Namu_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1664 
Symbol 
ID8447263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1828886 
End bp1829986 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content74% 
IMG OID645040787 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003201043 
Protein GI258651887 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0000133453 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0424658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCT TGACCCACCG CCCACGAGGA CGCCGGCTGC TGCAGGCGGC CGGCGCGATC 
GGCCTGGCCG CCCTCCTGGC CGGTTGCGCC TCCAGCTCGA CCGGGGCGGC CGGATCGGGC
ACGGCCAGCT CCACCGCCGG GTCGGCGTCC GCCGCACCGA GCGAGACCGG CGGCACGCTG
GTGGTCTACT CCGGCCGGAA CAAGGACCTG ATCGGCCCGC TGTTGGATCA ATTCAGCGCG
CAGACCGGGG TCGCGGTCGA GTTCCGCGCC GGTGATTCCG GCGAGCTGGC CGCCCAGCTG
CTGACCGAGG GCGACGCCTC CCCGGCGGAT GTCTTCTTCT CCCAGGACGC CGGCGCCCTG
GGCGCGGTCG CGCAGGCCGG CCTGTTCACC TCCCTGCCGG CGGCGACGGT CGAGGCGGTT
CCCGCGGCCT ACGCCGCGAC CGACGGCAGC TGGGTGGGCG TGTCCGGGCG GGCCCGGGTG
ATCGTCTACA ACCCGACGCT GGCGCCGAAC CCGCCGGACA CCATCGACGG GTTGCTGGAC
CCGCAGTGGA AGGGCCAGAT CGGGTTCGCC CCAACCAACG CGTCCTGGCA GGCGTTCGTC
ACCGGGCTGC GGGTGCTGCG CGGGGAGGAC GGGGCCCGGC AGTGGCTGAC CGCGTTCGCC
GCCCAGGAGC CCAAGGCCTA CGAGCGCAAC GGGGCGGTGC GCGACGCGGT GAACAGCGGC
GAGATCGCCC TGGGCCTGGT CAACCACTAC TACCTGTACG AGAAGATCGC CGCCGACGGG
GCGGACGCCG TGGTCGCCCA GAACCAGTAC CTGGCCGCGG GCGACCCGGG TGGGCTGCTG
AACGTGGCCG GGGTCGGCGT CCTGGCCTCC GCGCCGCACG CCGAGCAGGC CCAGGTCTTC
GTGGACTACC TGCTCTCGCC GGCCGGCCAG GAGTACTTCG CAGCCAAGAC CAAGGAGTTC
CCGCTGGTGC CGGGCACCGC CGCGGCCGCC GAGGTGCCGC CGCTGAGCGA GCTGAGCCCG
CCGCAGATCG ACCTGTCGCA GCTCAGCTCG CTGGAGCAGA CCCAGCAGCT GCTGTCCGAG
GTGGGGTTGC TGACCCGGTG A
 
Protein sequence
MRVLTHRPRG RRLLQAAGAI GLAALLAGCA SSSTGAAGSG TASSTAGSAS AAPSETGGTL 
VVYSGRNKDL IGPLLDQFSA QTGVAVEFRA GDSGELAAQL LTEGDASPAD VFFSQDAGAL
GAVAQAGLFT SLPAATVEAV PAAYAATDGS WVGVSGRARV IVYNPTLAPN PPDTIDGLLD
PQWKGQIGFA PTNASWQAFV TGLRVLRGED GARQWLTAFA AQEPKAYERN GAVRDAVNSG
EIALGLVNHY YLYEKIAADG ADAVVAQNQY LAAGDPGGLL NVAGVGVLAS APHAEQAQVF
VDYLLSPAGQ EYFAAKTKEF PLVPGTAAAA EVPPLSELSP PQIDLSQLSS LEQTQQLLSE
VGLLTR