Gene Ndas_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2089 
Symbol 
ID9245939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2510218 
End bp2511447 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content71% 
IMG OID 
Productmembrane protein 
Protein accessionYP_003680021 
Protein GI297561047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGTC GGGACAGGGC GGGCGCGGAC GGGGCCGTGG CCCGCGCACG CGCGTGGGTG 
AGCCGTGCGG TGGGTTCGGA CGGGCACGAA CGGCACACGG CGCTGTCCAT CGGCAAGAGC
GCCCTGGCCG CCTCGTTGGC GTGGTTCGTC GCACAGCACC TGATCCAGGC CGCCTCACCG
GCCTTCGCGC CCTTCACCGC GGTCCTGATG ATGCAGGTCA CCGTCTACCA GTCGGTGGTC
CAGTCGCTGC GCTACGTGGG CGCGGTGGCG CTGGGGATCA CCGTGCAGGC CGTCCTGGGC
TTCGCCGCGG GCCCGGACAT GCTGACCTTC GTCGTGGTCA CGCTCATCGC CCTGGTCATC
GGCCGGTGGA GGCCCCTGGG CAGCCAGGGG TCCCAGGTGG TCACGGCCGC GTTCTTCGCC
TTCTCCACCT TCCTGTCCTC CCAGGGCTAC ACCGGGCGCC TGGTGGACCT GGCCCAGATC
CTGTTGCTGG TGCTCATCGG CTGCGCCATC GGCACCGCGG TCAACGTCCT CGTGCTGCCG
CCCATGCGCT TTCGCAGCGC GGAGTACGCC ATCCGCTCCC TCGCGCACGC CGAGTGCGAC
CTGATCGGGG ACATCCGGCG CGGAATGGAA CGGGGGGAGC TGACCGAGGA CGAGGCCGAG
GACTGGCGCC AGCGGGCCAA CCGGCTCGCG TCGACCGTCC GGCAGACCCG TTCGGCGGTG
GACACCGCCT GGGAGAGCGT CTACCTCAAC CCGGCCAGGC TCCTGCGCAG ACACCGCCAC
CACGTCGCCT TCGAGGGCTA CCAGCAGTTG GTCGACGCCC TGGAGCGCAC CACCCACCAG
CTGGGCTCCC TGGCCCGCAG CATCCACCTG TGGAGCCGTG ACGGGGAGCG CTCCGTCCAC
CAGGACTTCC TGAGCCTCTA CGGGGACTTC CTCGCCTCGA TCGCCGAGAT CACCCGCGAG
CTCAGCGGGT TGGACCAGGA CCGGCTCGGC CCCCAGGCCA GGAGGCTGTG CCGGCTCGCG
GAGCAGGCCC GGGAGCGCTA CGACGACCTC GTGCGGGCCA AGGAGGGCGA GGACGCCCCG
CCCTTCGACG ACCTCCGGCT GCCCTACGGC GTGCTCCTGA TCGAGGCGCA GCGGCTCATG
GACGAGTTCC AGTACAGCTG TGACGTGATC CTGCACTACG TGGACCGCCC CGGTCCCGCG
GACGGGTCCG GCGGCGCCCG CCACATCTGA
 
Protein sequence
MTGRDRAGAD GAVARARAWV SRAVGSDGHE RHTALSIGKS ALAASLAWFV AQHLIQAASP 
AFAPFTAVLM MQVTVYQSVV QSLRYVGAVA LGITVQAVLG FAAGPDMLTF VVVTLIALVI
GRWRPLGSQG SQVVTAAFFA FSTFLSSQGY TGRLVDLAQI LLLVLIGCAI GTAVNVLVLP
PMRFRSAEYA IRSLAHAECD LIGDIRRGME RGELTEDEAE DWRQRANRLA STVRQTRSAV
DTAWESVYLN PARLLRRHRH HVAFEGYQQL VDALERTTHQ LGSLARSIHL WSRDGERSVH
QDFLSLYGDF LASIAEITRE LSGLDQDRLG PQARRLCRLA EQARERYDDL VRAKEGEDAP
PFDDLRLPYG VLLIEAQRLM DEFQYSCDVI LHYVDRPGPA DGSGGARHI