Gene Ndas_2914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2914 
Symbol 
ID9246766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3485533 
End bp3486831 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680830 
Protein GI297561856 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0970289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA ACGTCTTCGT CCTGGGGCTC GACGACCAGA ACCGGGAGGT CCTGCGCTCG 
CTGCCCGCCG AACGCGGCTA CCGCTTCCTC CAACTGCTCG ACCTCGACGA GATCGGCCAG
GACCGGCCGG TGGACCTGCC CGCGGTCCTG GCCCTGGCCG CAGGGAGGCT CGACTCCTTC
GGGGAGGGGG TCGACGCGGT CATCGGGTTC TGGGACTTCC CGGTCAGCTC GCTGGTGCCG
CTGCTGTGCC GGGACCGGGG GCTGTCGTGC GCGCCCCTGG AAGCCGTCGT CCGGTGCGAG
CACAAGTACT GGAGCCGTCT GGAGCAGCGC GCCGTCATCC CCGAGGCCGT TCCCCGGTTC
GCCGAGGTCG GCCTCGACAC CACGGCCAAA CCGGAGGGGC TGTCCTACCC GCTGTGGGTC
AAGCCGGTGA AGTCGTGCTC CTCCGAACTG GCCTTCCGCG CCGCCGACGA CGCCGAGTTC
CGTGCGGCGA TGGACCGGAT CAGGGCGGGC GCCGGACGCT TCGGCACGTC CTTCCAGTGG
GTGATGGACC TGCTGCGGCT GCCCCCGGAG GTGGCCGGGT GCGGGGGCAC GGCCGCGCTG
GCCGAGGAGG AGGCCACGGG CGACCAGCTC ACCGTCGAGG GCTACGTCCG TCGGGGCGTT
CCCCACGTCT ACGGGATCGT CGACTCCCAC CGCTATCCGG GCACGTCCAG TTTCCTGCGC
TACGAGTACC CGTCCAGGGT GCCCGAAGAG GTCGGCGCGC GCCTGACCGA GCTGTCCGAA
CGGGTGATCT CCCGGATCGG CCTGGACGAC GTCACCTTCA ACATCGAGTA CTTCCACGAT
CCGCGCAGCG GCGACGTCCG GCTCCTGGAG ATCAACCCGC GCCACTCCCA GTCGCACGCC
AGGCTGTTCG AACTGGTGCG CGGGGTGTCC AACCACGAGA TCTGCCTGAG GGTCGCGCTG
GGGGAGGAGC CCGAGCTGGA GCGCGGCGGG CGGTACGCGG TGGCCGCCAA GTGGTTCCTG
CGCCGCTTCC GCGACGGCTT CGTCGCGGCG GTGCCCACCG CCGAGGACGT CGCGCGTGCG
GAGAAGACGG CGCCGGGTAC GAAGGTGGAG ATCGTCGCGC GCCAGGGCCA GCGGCTCTCC
GACCTGCCCC TCCAGGACAG CTACAGCTTC GAGCTGGCCC AGGTGTTCGT GGGCGGCGGG
GACAGCGAGG AGTGCGAGCG CAGGTACCGG GCCTGCGTGG CGGAACTGCC CTTCACGGTC
CATGACACCG ACGACGTCCG ACAGGAGGGG GTCCGGTGA
 
Protein sequence
MTTNVFVLGL DDQNREVLRS LPAERGYRFL QLLDLDEIGQ DRPVDLPAVL ALAAGRLDSF 
GEGVDAVIGF WDFPVSSLVP LLCRDRGLSC APLEAVVRCE HKYWSRLEQR AVIPEAVPRF
AEVGLDTTAK PEGLSYPLWV KPVKSCSSEL AFRAADDAEF RAAMDRIRAG AGRFGTSFQW
VMDLLRLPPE VAGCGGTAAL AEEEATGDQL TVEGYVRRGV PHVYGIVDSH RYPGTSSFLR
YEYPSRVPEE VGARLTELSE RVISRIGLDD VTFNIEYFHD PRSGDVRLLE INPRHSQSHA
RLFELVRGVS NHEICLRVAL GEEPELERGG RYAVAAKWFL RRFRDGFVAA VPTAEDVARA
EKTAPGTKVE IVARQGQRLS DLPLQDSYSF ELAQVFVGGG DSEECERRYR ACVAELPFTV
HDTDDVRQEG VR