Gene Ndas_1487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1487 
Symbol 
ID9245337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1820670 
End bp1821935 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content69% 
IMG OID 
Productprotein of unknown function DUF21 
Protein accessionYP_003679423 
Protein GI297560449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.121973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.308123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGCT ACGGGGTTCA ACTCGGCCTC GTCCTTGTCC TCGTTCTCGT CAACGCCCTG 
TTCGCGGGGA GCGAGATCGC CCTGATCACC CTCCGGGAGG GGCAGATCAA ACAGTTGGCG
GCGCGGGGTC CCGGCGGCCG GGCGGTGGCT CGCCTGGCAC GGGACCCCAA CCGTTTCCTG
GCCACCATCC AGATCGGCAT CACCCTCGCG GGCTTCCTGG CCTCGGCCAC CGCGGCCGTG
TCCCTGGCGC AGCCCCTCAT CGAGCCCCTG GGCTTCCTGG GCTCGGCCGC CAGCCCGGTG
GCGATCGTCC TGGTGACCGT GCTCCTGTCC TTCGTCACGC TGGTCTTCGG AGAGCTGGCG
CCCAAGCGCA TCGCCATGCA GCGGGCCGAG ACGTGGGCGG TGCTGGTCTC CCGACCGTTG
GACCTGCTCG CCATGCTCTC CCGCCCCGTG GTGTGGCTGT TGAGCGTTTC CACCAACCTC
GTGGTGCGCC TGACGGGCGG TGACCCCTCC GCGGCCAAGG AGGAGGTCAG CGAGGAGGAG
CTGCGCGACA TGCTCGCCAC CCAGCGGGGC ATGACCCGGG AGCAGCGCAC CATCATCTCC
GGAGCCTTCG AGATCGACGA CCGGCGCCTG CGCCAGGTCG TCGTTCCCCG TGGTGAGGTG
TTCACCATCC CCGCCCGCAC GCCCGCGGCC CAGGCGGCGC AGATGCTCGC CGAACACGGG
CACTCCCGGG CGCCGGTGGT CAACGACGAC GACCTGGACG ACGTGCTCGG TGTCGTGCAC
TGGTCCGACC TGGTGCGCGG TGGGGCCGAC GCCGGAGAAC TGGCCCGCGA ACCGCTGCTC
CTGCCCGATT CCCTGGTGGT GTCGTTGGCC CTGCGCCGCA TGATCGCCGA GCACCAGCAG
CTGGGCGTGG TCATCAACGA GGTCGGTGGC GTCGACGGCA TCGTGAGCCT GGAGGACCTG
CTGGAGGAGA TCGTCGGGGA GATCTACGAC GAGACCGATT CCGACATACG CACCGTGACC
CGCAACGCCG ACGGGTCCTT CACCCTGCCC GGGACCTATC CCGTGCACGA CCTGCCCGAC
ATCGACATTC ATCTGGACGA TCTGCCCGAA GGGGACTACG TCACCGTCGC CGGGCTGGTC
ATCGCGGTGC TGGGCCACAT CCCCCAGGAG CCGGGGGAAG AGGTGGTGCT GGACTCCTGG
AAGGCCAGGA TCGACCAGGC CAACGGGCGC ACCGTCACCC AGGTGACCAT GTCCCCCGCG
GCGTAA
 
Protein sequence
MESYGVQLGL VLVLVLVNAL FAGSEIALIT LREGQIKQLA ARGPGGRAVA RLARDPNRFL 
ATIQIGITLA GFLASATAAV SLAQPLIEPL GFLGSAASPV AIVLVTVLLS FVTLVFGELA
PKRIAMQRAE TWAVLVSRPL DLLAMLSRPV VWLLSVSTNL VVRLTGGDPS AAKEEVSEEE
LRDMLATQRG MTREQRTIIS GAFEIDDRRL RQVVVPRGEV FTIPARTPAA QAAQMLAEHG
HSRAPVVNDD DLDDVLGVVH WSDLVRGGAD AGELAREPLL LPDSLVVSLA LRRMIAEHQQ
LGVVINEVGG VDGIVSLEDL LEEIVGEIYD ETDSDIRTVT RNADGSFTLP GTYPVHDLPD
IDIHLDDLPE GDYVTVAGLV IAVLGHIPQE PGEEVVLDSW KARIDQANGR TVTQVTMSPA
A