Gene Ndas_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1685 
Symbol 
ID9245535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2057488 
End bp2059137 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content72% 
IMG OID 
ProductOligopeptide transporter OPT superfamily protein 
Protein accessionYP_003679620 
Protein GI297560646 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0732845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.334429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCGT CGGCACCTGG TTCGTCCACC GAACCGCAGA CCCCGGGAAG CGCCCGCCAC 
CCGCGCGCGT TCGAACCCGT CGTCGTCATC GTCACCGTCC TGGTGAGCCT CCTCGGGGCG
GTGATCGGCA TCCACATGAT CACGACGCTC GGGGTCTCGC CCAACACCAG CGTCATCGGC
GCGGTCGTGG CCATGCTCAT CGGCCGGATC GGGTTCCTGG GGCTGCGCTC GATGCGCAAC
ACCAACCGGC AGAACCTCAT CCAGTCCTCG ATCTCCGGCG CGACCTTCGC CTCGGCGAAC
TCCCTGCTCA CCCCGATCGC CATCCCGTTC CTCTTCGGCC GCCCCGACCT GGTGTGGCCG
ATGCTGCTGG GCGCGTCCCT GGGCCTGCTC ATCGACGTGT TCGTGCTCTA CAAGGCGTTC
GGCTCCCGGT TCCTGCCCGC CGACGCGGCC TGGCCGCCCG GGGCGGCGGC GGCCGAGACC
ATCAAGGCCG GTGACCGGGG CGGACGCCAG GCGGCCATCC TGGTGGGCGG CGGCGCGGTC
GGGCTCGGCG CCTCCTTCCT GGGCATGCCG ATGTCGGCGG CGGGCATCGC CATGATCGGC
AACGTCTGGG CCCTGCTCAT GTTCGCCGTG GGCCTGCTCG TCGCCCAGTA CTCCCCGGCG
GTCATCGGGA TCGACCTCAA CTCGATCTAC GTGCCGCACG GCGTCATGAT CGGCGCGGGC
GTGGTCGCGC TGGCGCAGAT CGTGGTCATC CTCGCCGGCC GCCAGAGCCG CAGGGAGAGG
GAGCGCGAGG CCGCCCGCGA CCGCGCCGCC CAGGACGACC CGTCCCTGGC CTACACCGTG
GACCGCGCCA CCCTGGGCCG GGCCCTGGGC TCGGGCTACG TGCTGTTCGC CCTCGGTGCC
CTCGTGCTCG CGGTCACCGG CGGGATCTGG GCGGACATGA GCTGGCTGGG CATCCTCGGA
TTCGTCCTGT TCGCCGCCGT GGCCGCCCTG GTCCACGAAC TCATCGTCGG CCTGGCCGCC
ATGCACGCGG GCTGGTTCCC CGCCTTCGCG GTCACCCTCA TCTTCCTCAT CCTCGGCCTG
GCGCTGGGCA TCCCCGGGGT GCCGCTGGCC CTGCTCGTGG GCTACTGCGC GGCCACCGGC
CCCGCCTTCG CGGACATGGG CTACGACTTC AAGGCCGGGT GGGTGCTGCG CCGCGACCGC
CGCCCCTACA CCGCCTTCGA GCTCGACGGA CGCCGCCAGC AGCTCATCTC CTCCATGATC
GGGTTCGCCG TCGCCATCGG CATGGTCGCG CTGCTCTGGC AGGGCCTGTT CGAGGACGGC
GCCGTGCCGC CCACCTCGAT CGTCTACGCC GACACCATCA AGGCCGGGCT GAGCGACCCC
TCCGTCCTGC TCCAGCTCGC CCTGTGGGCC GTGCCCGGCG CGATCGTGCA GCTCCTGGGC
GGCCCCCGGC GCCAGATGGG CGTCCTGCTC GCCACCGGCC TGCTCGTGGC CACGCCCAAC
GCCGGATGGC TCGTCCTGGC CGGGCTGGCG ATCCGCCTGG TGTGGGAGCG CCGCCGCGGC
GAGAAGGGCG AGCAGGAGAT CGCCCTGGTC GGCGCCGGGC TCATCGCCGG GGACTCCGTC
CACTCCGTCG GCACCGTCTT CAGCCGCTGA
 
Protein sequence
MEPSAPGSST EPQTPGSARH PRAFEPVVVI VTVLVSLLGA VIGIHMITTL GVSPNTSVIG 
AVVAMLIGRI GFLGLRSMRN TNRQNLIQSS ISGATFASAN SLLTPIAIPF LFGRPDLVWP
MLLGASLGLL IDVFVLYKAF GSRFLPADAA WPPGAAAAET IKAGDRGGRQ AAILVGGGAV
GLGASFLGMP MSAAGIAMIG NVWALLMFAV GLLVAQYSPA VIGIDLNSIY VPHGVMIGAG
VVALAQIVVI LAGRQSRRER EREAARDRAA QDDPSLAYTV DRATLGRALG SGYVLFALGA
LVLAVTGGIW ADMSWLGILG FVLFAAVAAL VHELIVGLAA MHAGWFPAFA VTLIFLILGL
ALGIPGVPLA LLVGYCAATG PAFADMGYDF KAGWVLRRDR RPYTAFELDG RRQQLISSMI
GFAVAIGMVA LLWQGLFEDG AVPPTSIVYA DTIKAGLSDP SVLLQLALWA VPGAIVQLLG
GPRRQMGVLL ATGLLVATPN AGWLVLAGLA IRLVWERRRG EKGEQEIALV GAGLIAGDSV
HSVGTVFSR