Gene Ndas_1498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1498 
Symbol 
ID9245348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1836568 
End bp1837671 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidylprolyl isomerase FKBP-type 
Protein accessionYP_003679434 
Protein GI297560460 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.169435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000876054 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACCGAC GTGCTGCCGC CCTCGCGGTG CCGCTGACCG CGATCGCCCT GGCGGTCTCA 
AGCTGCGGAA ACATCCCCGA AGAGTGGCGT ACTCCCGCCT TCATGCGTAT GGGGGAGGAC
CAGCTCGACC CGCGGCTTCC CGAAGTCACC GGCGAGGTCG GTGAGGAACC GGAGGTCGCC
TTCCCCGACG AGGAGCCCCC CACCGAGCAG ATCGCGGGCG TCGTCGACGA GGGCCCGGGC
GAGAACGAGC TGGTGCGCGC CGACGACCTC CTGATCGCCA ACGTCGTCCA GTTCCAGTGG
ACCGGCCCCG GCGAGGGCGC GCCCGTCGAG GGGCAGTCCA GCTACGAGAC CGGCGCCCCG
GACCTGATCC GCATGGAGCA GATGCCCGCG GAGATCAGCG ACGTGCTGGT CAGCCAGCCG
GTCGGCAGCC GGGCCGTGTA CGTCTTCCCG CCCCTGACCG AGCAGGAGCG CCAGCAGGCC
GAGATGTCGG GACAGCCCGT CCAGGAGGGC GCGAGCGTCC TGGTCATCGA CCTGATGGAC
CGCTTCAACA AGGGTTCGGT CGTGGAGGGC CAGCAGGTCA CCGACGGCGG CGACGGCCTG
CCCACGGTGA CCCAGGAGGG CCACAGCGAG CCCACCATCG AGGTCCCCGA CACCGATCCC
CCCGAGAACC TGGAGGTCGT CCCGCTCATC GAGGGCGACG GCGCCGAGGT CGAGGAGGGC
CAGCAGGTCA TCGTCCAGTA CAGCGGTGTG CGCTGGGAGG CCGACGACAA CGGCGAACAC
CCGGTGTTCG ACTCCACCTG GAGCCGCGGC GGCGACCCCT TCGACACCAC GATCGGCGCG
GGCGCGGTCA TCGAGGGCTG GGACGAGGGC ATCGTCGGCC AGCCGGTCGG CAGCCGCCTG
ATGCTGGTCG TGCCCGGCGA CATGGCCTAC GGCGAGACCG AGGAGGAGTC CGGGGGAGCC
CCCGCCGGGA CGCTGGTCTT CGTCATCGAC ATCCTGGGCG CCTACGACAA CCCCCCGGCC
CCCGAGCCCG CAGAGGGCGA GGGCGCCGGC GGCGAGGAGG CCGCACCGGA GGAGTCCCCC
GCGCCCGAGG AGGGCGGGGA GTAG
 
Protein sequence
MHRRAAALAV PLTAIALAVS SCGNIPEEWR TPAFMRMGED QLDPRLPEVT GEVGEEPEVA 
FPDEEPPTEQ IAGVVDEGPG ENELVRADDL LIANVVQFQW TGPGEGAPVE GQSSYETGAP
DLIRMEQMPA EISDVLVSQP VGSRAVYVFP PLTEQERQQA EMSGQPVQEG ASVLVIDLMD
RFNKGSVVEG QQVTDGGDGL PTVTQEGHSE PTIEVPDTDP PENLEVVPLI EGDGAEVEEG
QQVIVQYSGV RWEADDNGEH PVFDSTWSRG GDPFDTTIGA GAVIEGWDEG IVGQPVGSRL
MLVVPGDMAY GETEEESGGA PAGTLVFVID ILGAYDNPPA PEPAEGEGAG GEEAAPEESP
APEEGGE