Gene Ndas_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2103 
Symbol 
ID9245953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2522791 
End bp2523852 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID 
Productbiotin synthase 
Protein accessionYP_003680034 
Protein GI297561060 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.881041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACAT TCCTTCCCCT GGCCGAGAAG GCCCTTCGCC GCGAGACGCC CACCCGCGCC 
GAACTCCTCG GAGTTCTGAG CAGCTCCGAC GACGACCTGA TGGACCTGGT CGCCGCGGCC
TTCCGCGTCC GACGGCGTTA CTTCGAGCGC CGCGTCAAGC TCAACTACCT CGTGAACGTC
AAGAGCGGAC TGTGCCCGGA GGACTGCTCC TACTGCTCCC AGCGCCTGGG CTCCGACGCC
GGAATCGTCA GGTACACCTG GCTCAGGCCC GACCAGACCC GCGAGGCCGC CTCCCACGGG
GTGAGGGCGG GCGCCACCCG GGTCTGCCTG GTGGCCAGCG GGCGCGGCCC CACCGACCGC
GACGTGGAAC GGCTGGCCCC CGCCATCGAG GGCATCAAGA GCGAGCACCC CGGCGTGGAG
GTGTGCGCCT GCCTGGGTCT GCTCTCCGAG GGCCAGGCCG ACCGGCTGCG CCGGGCCGGG
GCCGACGCCT ACAACCACAA CCTCAACACC TCGGAGGAGC GCTACGCCGA CATCTGCACC
ACCCACGCGT TCGCCGACCG GGTGGACACC GTGACCCGGG CGAAGCAAGC GGGCCTGTCC
CCCTGCTCCG GGCTGATCGC CGGGATGCGG GAGAGCGACG ACGACCTGGT CGACGCCCTG
TTCGCGCTGC GCGAACTGGA ACCGGACTCG GTGCCGGTCA ACTTCCTCAT GCCCTTCGAG
GGCACCCCGC TGGAGGGCAC CTGGGAGCTG ACCCCGCAGC GGTGCCTGCG CGTCCTGGCC
GCCGCCCGGT TCGTCTTCCC CGACGTGGAG GTGCGCCTGG CGGGCGGCCG CGAGATCCAC
CTGCGCTCCC TCCAGCCGCT GGCGCTGCAC ATCGCCAACT CGGTGTTCCT GGGCGACTAC
CTCACCAGCG AGGGACAGGC GGGCCGGGCC GACCTGGAGA TGATCGAGGA CGCCGGTTTC
ACCGTGCAGG GCGCGGGCGA GCCCACCCTG CCCAGGGAGC GCCACGACCT GCTCAGGCCG
CGCCGGAGGG GCGCCGGAAC CGCTCTTCCG CCCAACGCCT GA
 
Protein sequence
MTTFLPLAEK ALRRETPTRA ELLGVLSSSD DDLMDLVAAA FRVRRRYFER RVKLNYLVNV 
KSGLCPEDCS YCSQRLGSDA GIVRYTWLRP DQTREAASHG VRAGATRVCL VASGRGPTDR
DVERLAPAIE GIKSEHPGVE VCACLGLLSE GQADRLRRAG ADAYNHNLNT SEERYADICT
THAFADRVDT VTRAKQAGLS PCSGLIAGMR ESDDDLVDAL FALRELEPDS VPVNFLMPFE
GTPLEGTWEL TPQRCLRVLA AARFVFPDVE VRLAGGREIH LRSLQPLALH IANSVFLGDY
LTSEGQAGRA DLEMIEDAGF TVQGAGEPTL PRERHDLLRP RRRGAGTALP PNA