Gene Ndas_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0004 
Symbol 
ID9243830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5627 
End bp6760 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content74% 
IMG OID 
ProductDNA replication and repair protein RecF 
Protein accessionYP_003677963 
Protein GI297558989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.780515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00205006 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTACGTAT CCCACCTGCA ACTGGCCGAC TTCCGGTCCT ACCGCGAGGC CCTCGTGGAG 
ATGGGCCCCG GCGTGAGCGT GTTCGTCGGC GCCAACGGCC AGGGCAAGAC CAATCTGGTC
GAGGCGATCG GCTACGTGGC CACCCTCGGC AGCCACCGGG TCTCCTCCGA CACCCCGCTG
GTCCGCCAGG GCGCCCCCCG CGCGATCGTC CGGGCCAAGG TGGTGCGCGA CGAGCGGTCC
ATGGTCGTGG ACCTGGAGCT CAACCCCGGC AGGGCCAACC GGGCCCGGAT CAACCAGGCG
CCCGCGGGCC GTCCGCGCGA GGTCCTGGGG ATCCTGCGCA CCGTGCTCTT CGCCCCGGAG
GACCTGGCCC TGGTCAAGGG CGACCCCGGC GAGCGGCGCC GGTTCCTGGA CGACCTGCTG
GTGGCGCGCG CGCCCCGGAT GGCGGGCGTG CGCTCGGACT ACGACCGGGT GCTCAAGCAG
CGCAACGCCC TGCTCAAGTC GGCCTCCGGC CGGATGTTCC GCCAGCGCTC GGCGCCCGAC
CTGAGCACGC TGGAGGTGTG GGACTCCCAC CTGGCGGAGA CGGGCGCGGA GCTGCTGGCG
GCGCGGCTGG AGCTGGTGGA GGAGCTGCGC CCGAGGATCG CCGAGGCCTA CGCCGGGCTG
ACCGACTCCG GGGGCCCGGC CGTCCCCGAC TACCGCAGCG GCGCGGTCCC CGAGGGGGTC
GAACCGCCGA CCGGCCGTCC ACAGCTTGTG GAAACCCTGC GCGCGGCCAT GGCCGAGGCC
CGCGACCGCG AGCTCCAGCG CGGCGTCAGC CTGGTGGGCC CGCACCGCGA CGATCTGGTC
CTGCGACTGG GCGGGATGCC CGCCAAGGGC TACGCCAGCC AGGGCGAGTC CTGGTCGTAC
GCCCTCTCGC TCAAGCTGGC CGCCTTCGAC CTGCTGCGCT CCGACGGAGA CGACCCGGTG
CTGATCCTGG ACGACGTGTT CGCCGAGCTG GACAGCGAGC GCCGCCGCAG GCTGGCCGAG
CGCGTCGGCG ACGCCGAACA GGTCCTGGTG ACCGCGGCCG TGCCCGAGGA CATCCCCAAG
GAGCTGGACG GGGCCCGGTT CGGCGTGCGC GAGGGGGGCG TCGCGGGTGA GTGA
 
Protein sequence
MYVSHLQLAD FRSYREALVE MGPGVSVFVG ANGQGKTNLV EAIGYVATLG SHRVSSDTPL 
VRQGAPRAIV RAKVVRDERS MVVDLELNPG RANRARINQA PAGRPREVLG ILRTVLFAPE
DLALVKGDPG ERRRFLDDLL VARAPRMAGV RSDYDRVLKQ RNALLKSASG RMFRQRSAPD
LSTLEVWDSH LAETGAELLA ARLELVEELR PRIAEAYAGL TDSGGPAVPD YRSGAVPEGV
EPPTGRPQLV ETLRAAMAEA RDRELQRGVS LVGPHRDDLV LRLGGMPAKG YASQGESWSY
ALSLKLAAFD LLRSDGDDPV LILDDVFAEL DSERRRRLAE RVGDAEQVLV TAAVPEDIPK
ELDGARFGVR EGGVAGE