Gene Ndas_2641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2641 
Symbol 
ID9246492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3150277 
End bp3151503 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content71% 
IMG OID 
Productprotein of unknown function UPF0118 
Protein accessionYP_003680564 
Protein GI297561590 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.517547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.208528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACACGA GCACCACACC GCCCGGGTCC GCGCGCCCCG ACCGCGGCGG CGGCATGCCG 
CGCTGGCTCC CCCGGGCCAT GCTGCTGGCC CTGTGGCTGG TCACCGCGTT CGGCCTCACC
CTGTGGCTGT TCGTCCGGTT GCAGAGCCTC ATCATGCTGC TGCTGATCTC GCTGTTCCTC
GCCCTGGCGC TGGAACCGGC GGTCAACTGG CTCCACCGGC ACCGCTGGCC GCGCGGGCCC
GCCACCGGGC TGGTGATGCT GCTGGTACTG GCGCTGACCG TGGTGTTCCT CAGTCTGCTC
GGGTCGATGC TGGTCGGCCA GATCCTGGCC TTCGTCTCCG AGATCCCCGC GATGATCCGC
GCCGCGCTGG CCTGGGTCAA CACCACGTTC GACACCTCCT ACTCCCCCAC CACCCTGCTC
AACGAGATCT CCAGCGCCAG CGGGCTGATC GAGCAGTACG CCTCCGGTAT CGCCAACAAC
GTCTGGGGCG CCGGGACGAC CGTCCTGGCG CTGCTGTTCA ACGCGCTGAC GATCGCGCTC
TTCACCTTCT ACCTGTGCGC CGACGGCCCG CGCTTCCGCC GCGTGATCTG CTCGGTCCTG
CCGCCGCGCA CCCAGCGCGA GGTGCTGCGG GCCTGGGAGA TCGCGATCAC CAAGACGGGC
GGTTACCTCT ACTCCCGTGC GCTGCTGGCC CTGGTCTGCT CGGGCGCGCA CTACGTGGTG
CTGGTCGCGC TGGACATCCC CTTCGCGTTC GCCCTGGCGC TGTGGGTGGG CGTGCTGTCG
CAGTTCATCC CCACCGTGGG CACCTACATC GGCGGGGTCG TCCCGGTGCT CGTGGCGCTG
ATGGAGGGCA TCTGGCCCGC CGTGTGGGTG CTGGTGTTCA TCGTCGTCTA CCAGCAGTTC
GAGAACTACC TGCTCCAGCC GCGCATCACC GCCAGGACCC TGGACATGCA CCCGGCGGTG
GCGTTCGGCT CGGTCCTGGC GGGCGTGGCC ATCCTGGGGG CGCCCGGCGC GCTGCTCGCG
CTGCCGATGG GCGCGAGCAT GCAGGCGTTC CTGGGGACCT ACATCCGGCG CTACGAGGTG
GCCGAGCACC CCCTGCTCTC CGACGCCGAG GAGGACGGGA AGGGCGGGAA ACCCTCCCCG
GATCCGGTGG CCCCGGTCTC CGAGGGCGAC GGGGGCGACG CGCGTCCCCC CGGTCCGCGG
GAGCGGGAGG GCGGGGAGGG ACCGTGA
 
Protein sequence
MHTSTTPPGS ARPDRGGGMP RWLPRAMLLA LWLVTAFGLT LWLFVRLQSL IMLLLISLFL 
ALALEPAVNW LHRHRWPRGP ATGLVMLLVL ALTVVFLSLL GSMLVGQILA FVSEIPAMIR
AALAWVNTTF DTSYSPTTLL NEISSASGLI EQYASGIANN VWGAGTTVLA LLFNALTIAL
FTFYLCADGP RFRRVICSVL PPRTQREVLR AWEIAITKTG GYLYSRALLA LVCSGAHYVV
LVALDIPFAF ALALWVGVLS QFIPTVGTYI GGVVPVLVAL MEGIWPAVWV LVFIVVYQQF
ENYLLQPRIT ARTLDMHPAV AFGSVLAGVA ILGAPGALLA LPMGASMQAF LGTYIRRYEV
AEHPLLSDAE EDGKGGKPSP DPVAPVSEGD GGDARPPGPR EREGGEGP