Gene Ndas_2980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2980 
Symbol 
ID9246833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3558379 
End bp3560052 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content77% 
IMG OID 
ProductTAP domain protein 
Protein accessionYP_003680896 
Protein GI297561922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.085257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGTGCG GGGTGGGTGC CCCCGGGTTC GGTGGGCTCA TGGCGGTGGT GCTGGCCGCC 
GCCTGCTCTC CTCCCGCCGA CGACCCCTTC CTCGGCCAGG ACGTGGCCTG GCGGCCCTGT
CACGAGAACG GGCGGACCAC CGAACCCTCC GACCACGACA CCGAACCCTC CGAGCGGCGG
GCCACACCCA CCACGCGCGC CATCGGCCCC GCCGACCGCG GATCCGACAC GGAGTGGCTG
GAGAGCCTGG AGTGCGGCAC GGTCACGGTC CCCCTGGACC ACGACGAGCC CGGCGGCAGG
ACCCTCGACA TCGCCCTGGT CCGCGCGCCC GCCAGCGGCC CGCCGCAGGA GCGCCTGGGC
TCCCTGGTCG TCAACCCGGG CGGCCCCGGC GCCTCCGGGG TCCGGGCCCT GGACACGCCC
CTGTTCGGCG ACGACGTCCG CGCCGCCTTC GACCTCGTCT CCTTCGACCC GCGCGGCGTC
GGGGACAGCG GCGGCTTCGC CTGCGGGGAC CGGTACGCCC TGGTCGAGGC GCGCCAGGGC
GTGGCCGGTA CCGACCCCGG CGACCTGGAC GCCGTCGAGC TCCAACCCCT GGAGGACGCC
GCCCGCAGGT ACGCCGGGGC CTGCGCGCAG ACCGTGGGGG AGGAGTTCCT GACCCGCCTC
GGCACCGTCC ACGTCGCCCG TGACCTCGAC ATCGTCCGCG ACGCCCTGGG CGAGGAACGG
CTGAGCTTCG TCGGCCACTC CTACGGCGCC CACCTGGGCG CCCTCTACGC CCACCTGTAC
CCCGACCGCG TCCGGGCGGC GGTCCTGGAC GGCGCCGCGG CGCCCGGCAC CTCCAACGCG
CGGGCCGCCG TCGAGCAGGT CGCCGCCCTC CAGGGCACCT GGAACGCGTT CGTGGCCCAC
TGCGCCCGCG ACGCGGCCTG CCCGTTCGCC GGAGCGGACC GCGCCCCCGG CTCCGGCGGA
GCCGGAACCC TCAGCGAGGC CGACGTCCGG GCGGCCGCGC TCCTGCGCGG CCTCGACCGC
GTCCCCGCCG AGGCCGAGGG GATCCCGGTG GACGGGCGCG CGCTGACGGC CATGGTGGTC
ATGGAGCTGT ACCGCGAGGA CGGCTGGGAC CTGCTCGCCG ACCTGTTCAC CGCCCTCGCC
GGGGACGACG CCGAGGGGAC CGCGCTCCAC CTCGGGCGGC TCCACGACCG CACCTTCGGC
GCGTACGCGC GGGCCGGCTC CGACGAGGCC GTACCGGGAA CGGAGCACCA GGACCCGGGC
GCGGTGTTCA CCGCCGTGAA CTGCGCGGAC CGCGCCGACC CGGCGACCGT GGAGGCCTAC
CGCGACGCCG CCGACGAGGC GGCCGACCTC GCCCCGCTCT TCGGCCCCGA CCCGGTGTGG
GACCACCTGC CCTGCGCGTA CTGGCCCGAG ACGGGGGAGG CCCCCGCGGC GACCGCGCCG
GACGCGCCCC CGATCGTGGT CGTGGGCGCC GTCGGAGACC CCGCCACCCC CTACGCCTGG
GCCGAGGACC TGGCCGAACG GATGGAGAGC GCCACCCTGG TCACCTACGA CGGGGCGGGG
CACACCGTCT ACGGCACGGG CCGCAGCCCG TGCGTGGACG AGGCGGTGGA CGCCTACCTG
CTCACCGGCG AGGTCCCCGA ACCCGGGCTC ACCTGCCCCG GCACACTCGA CTGA
 
Protein sequence
MRCGVGAPGF GGLMAVVLAA ACSPPADDPF LGQDVAWRPC HENGRTTEPS DHDTEPSERR 
ATPTTRAIGP ADRGSDTEWL ESLECGTVTV PLDHDEPGGR TLDIALVRAP ASGPPQERLG
SLVVNPGGPG ASGVRALDTP LFGDDVRAAF DLVSFDPRGV GDSGGFACGD RYALVEARQG
VAGTDPGDLD AVELQPLEDA ARRYAGACAQ TVGEEFLTRL GTVHVARDLD IVRDALGEER
LSFVGHSYGA HLGALYAHLY PDRVRAAVLD GAAAPGTSNA RAAVEQVAAL QGTWNAFVAH
CARDAACPFA GADRAPGSGG AGTLSEADVR AAALLRGLDR VPAEAEGIPV DGRALTAMVV
MELYREDGWD LLADLFTALA GDDAEGTALH LGRLHDRTFG AYARAGSDEA VPGTEHQDPG
AVFTAVNCAD RADPATVEAY RDAADEAADL APLFGPDPVW DHLPCAYWPE TGEAPAATAP
DAPPIVVVGA VGDPATPYAW AEDLAERMES ATLVTYDGAG HTVYGTGRSP CVDEAVDAYL
LTGEVPEPGL TCPGTLD