Gene Ndas_2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2654 
Symbol 
ID9246505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3161870 
End bp3163018 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content73% 
IMG OID 
Producttransport system permease protein 
Protein accessionYP_003680577 
Protein GI297561603 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00408489 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.22547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAC TCACCGACCC TTCGCGGACC GGTCCGCCCG ACGCGGACGC TCTCCCCGAC 
GCCGACCCGC TGGCGGCGCG GGCGGCGGCG GGCGGCGCCG AGGTCCCTCC CGAACGCACA
CCGGGCAGGC TGCGCTCGTC CCTCGTCGTC GCCGCCTTCG TCCTCGCGCT GTGCGTGACG
ATCGTCGTCG CGGCCTTCGT CGGCACGGCG AACATCGGGG CGCTCGACGT GCTCGGGATC
ATCCTGCGCA ACATCGGGCT GGGCGCCCTC GCCCCCGTCC CGGCGGCGCC GCCGCTCATC
GACGCCCTCA TCTGGGAGTC GCGTCTGCCG CGCGTGCTCC TGGCCGCCGT GGTCGGTCTC
GGGCTCTCGG TCTCGGGCGC GGTGCTGCAG TCGATCACCC GCAACCCGCT CGCCGAACCG
TACCTGCTCG GGGTCTCCTC GGGGGCGTCC ACCGGGGCGG TCGCGATCAT GGTGCTCGGG
CTCGGCTCGG GCGCGGTGAC CCTCTCCACG GGCGCCTTCG CCGGGGCGCT GGCCGCCTTC
GCGATCGTGC TGGTCCTCAT CGGGGGCGGA CGCGTCTCCA ACCCCGCCCG CGTGGTGCTC
ACCGGTGTGC TGGTGTCGCA GTTCTTCTCC GCGATCACCT CGCTCGTGCT GATGCTCGAC
GGTGACGCGG ACGCCACCCG GGGCTTCACG TACTGGCTGC TCGGCTCGCT CGGAGGCGCA
CGCTGGGAGC CGCTGCTCGT GGCGTCTGCC GTCATCGTGC TCGGCGCCGT CGGCTGCCTG
TTCTTCGCCC CGGCCCTGGA CGCGTTCACC TTCGGCTGGG ACACCGCCTC CTCGCTCGGG
ATCAACGTGA CCCTGGCACG GGTGACGCTC ATGGTCCTCA CCGCGCTCGT CACGGCGGCG
GCCGTCGCGG CCTCCGGGGC GATCGGGTTC ATCGGGCTGC TCGTACCGCA CGTCGTGCGT
CTGCTGGCGG GGCCCGCGCA CCGCCTGCTG CTCCCCCTCA GCGGGCTCGG GGGCGCGATT
TTCCTGGTGT GGGTCGACAC CTTCGCCCGC TCGGCGTTCT CGCCGCACGA GATCCCGGTG
GGAGTGATCA CGGCGCTGCT CGGCGCACCG GTGTTCGCGG TCGTCCTGGG AAGGGCGGCC
CGGCAATGA
 
Protein sequence
MTRLTDPSRT GPPDADALPD ADPLAARAAA GGAEVPPERT PGRLRSSLVV AAFVLALCVT 
IVVAAFVGTA NIGALDVLGI ILRNIGLGAL APVPAAPPLI DALIWESRLP RVLLAAVVGL
GLSVSGAVLQ SITRNPLAEP YLLGVSSGAS TGAVAIMVLG LGSGAVTLST GAFAGALAAF
AIVLVLIGGG RVSNPARVVL TGVLVSQFFS AITSLVLMLD GDADATRGFT YWLLGSLGGA
RWEPLLVASA VIVLGAVGCL FFAPALDAFT FGWDTASSLG INVTLARVTL MVLTALVTAA
AVAASGAIGF IGLLVPHVVR LLAGPAHRLL LPLSGLGGAI FLVWVDTFAR SAFSPHEIPV
GVITALLGAP VFAVVLGRAA RQ