Gene Ndas_3464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3464 
Symbol 
ID9247333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4152904 
End bp4154037 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content68% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003681371 
Protein GI297562397 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTAGGC GAGTAGCGAT GGCGGTGGAC GTGCGCACCG AACCACCCAC CCTCGCCGCA 
TGGCTGGGGG AGTGGTTGGC GGGCAGGCCC GATCTGGCCG CGGGTACCCG GGCCTCCTAC
GCCAGGCACA TCCGCACCTA CCTTTCCCCG CACCTGGGCC GGCTCCGGGT GGACAAGCTC
CAGGCCCGCC ACGTCGAGAC CATGTTCAAC CTCATTGAGG AGACCAACGC TCACATCCTG
GAGTGCCGCG AGTCCGACGA TGTGAAGGTG CGGGCCTCGG TCAGGGGGCG CCGGGTGATC
TCGCTGTCCA CCAAGCACCG GATCCGCGAG ACTCTCCGGA GTGCGTTGTC CGAAGCCGTC
AGGCGTCCGG ACTTGCCGGT GTCGGTGAAC ATCGCCTCAC ACGTGCGGTT GCCCTCGTGT
CCGAGGAAGC GTCCGTTGGT GTGGACTTCG GACCGGGTGC GGCAGTGGAA GAAGGACGGG
ACCGTGCCGG GGGAGGTCAT GGTGTGGGCC CCGGAGCAGA CCCGCGCTTT CCTGGTGCAC
GCGAGGAGGT ATCCGTGGCT GTACCCGATG TTCCACCTGG TCGCGGTCAA GGGCCTGCGG
CGAGGAGAAG CGGTCGGCCT GCCCTGGTCC AACACGAGGT TGACCGACGG GCAGATCGAC
ATCCGCGTCC AGGTCGTACA ACTCGCGTGG GAGACGATCA CCTCCACCCC GAAAAGCGCG
GCCGGGCAGC GCACCATCAC ATTGGACACC GACACCATCA AAGTGCTGCG GGCCTGGAAG
CGGTTCCAGA ACGAAGCCCG CCTCAAAGCC GGAACAGCGT GGACGGACAC CGGTTTGGCC
TTCACCAAGG AGAACGGCGC AGGATGGCAC CCAGGTCAGG TCAGTGACTG GTTCCTGCGC
ATCGCCAGGG CAGCGGGGCT GCCGCCGATC ACGTTGCACG GGCTCCGCCA CGGAGCCGCC
TCGCTGGCGT TGGCCGCGGG AACGGACGTG AAGATCGTCT CCAGCGAGCT CGGGCACGCG
ACCACGCACT TCACCCAGGA CACCTACCAG TCGGTGTTCC CCGACGTGGC CAAAGCGGCG
GCCGAGGCCA CCGCCGCGCT GCTACGAGGC CCGTCTCCGG TGAGGGCCGG GTAG
 
Protein sequence
MRRRVAMAVD VRTEPPTLAA WLGEWLAGRP DLAAGTRASY ARHIRTYLSP HLGRLRVDKL 
QARHVETMFN LIEETNAHIL ECRESDDVKV RASVRGRRVI SLSTKHRIRE TLRSALSEAV
RRPDLPVSVN IASHVRLPSC PRKRPLVWTS DRVRQWKKDG TVPGEVMVWA PEQTRAFLVH
ARRYPWLYPM FHLVAVKGLR RGEAVGLPWS NTRLTDGQID IRVQVVQLAW ETITSTPKSA
AGQRTITLDT DTIKVLRAWK RFQNEARLKA GTAWTDTGLA FTKENGAGWH PGQVSDWFLR
IARAAGLPPI TLHGLRHGAA SLALAAGTDV KIVSSELGHA TTHFTQDTYQ SVFPDVAKAA
AEATAALLRG PSPVRAG