Gene Ndas_5546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5546 
Symbol 
ID9249449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp743523 
End bp744641 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content76% 
IMG OID 
ProductLAO/AO transport system ATPase 
Protein accessionYP_003683431 
Protein GI297564458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGAGC AGGGTTCCGT GCCGGGCGCG CGGCGTTCGG GGCGGCGCCG CCCGGTGGAC 
GTGGACGCCC TCGCCGAGGG GGTCCTGGCC GGGCACCGGC CCACCCTCGC GCGGGCGATC
ACCCTCGTGG AGTCCCGCCG CCCGGACCAC GCGCGCGCCG CGCAGGAACT GCTGGTGCGG
CTGCTGCCGC ACACCGGCGG GGCCCGCCGG GTGGGGATCA CCGGGGTCCC CGGCGTCGGC
AAGTCCACGT TCGTCGACGC GCTGGGCACC CGGCTGACCA AGGCCGGGCA CCGGGTGGCG
GTGCTGGCCG TGGACCCCTC CTCCACGCGC ACCGGAGGCA GCATCCTGGG TGACAAGACC
CGGATGGAGC GCCTGGCGGT GGACCCCGAC GCGTTCATCC GGCCCTCGCC CACGGCGGGG
ACCCTGGGCG GCGTCGCCCG GGCCACGCGG GAGACCATGC TGCTCATGGA GGCGGCCGGG
TTCGACGTGG TCCTGGTGGA GACGGTGGGC GTCGGCCAGT CCGAGGTCGC GGTCGCCGCC
ATGGTGGACT GCTTCTGCTT CCTCACGCTG GCCCGCACGG GCGACCAGCT CCAGGGCATC
AAGAAGGGCG TGCTGGAGCT GGTGGACGTG GTCGCCGTCA ACAAGGCCGA CGGACCGCAC
GCCGACGACG CGCGCAAGGC GGCCCGCGAG CTGTCGCGGG CGCTGCGGCT GCTCCAGCCG
GTGCACCCGG ACTGGCGTCC GCCGGTACTC ACGTGCAGCG GTCTGACCGG CGACGGGCTC
GACGAGGTGT GGGACGCCGT GACCCGGCAC CGCGCGGTGC TGGAGAGCGA CGGCGCGCTG
GCCGAGCGCC GCAGCCGCCA GGGGGTCAGC TGGATGTGGG ACCAGGTCCG CGACCAGCTC
ATGGACGCGT TCCTGCGCGA TCCCCGGGTG GCCTCGCTCC TGCCCGGGAC GGAGGAGCGG
GTGCGGTCGG GGGAGACCAC GGCGACCCTC GCCGCGCGCA CGCTGCTGGA CTCCTTCACC
CGGGGCCGCG GGGTCCGTCC GACGGGTGAA CCCGGAGGCG GCGCGGGGCC GGAGCGCGGC
ACGGAGCCCT CCCGGGAGCC GGGTTCGTCC GACGGGTGA
 
Protein sequence
MAEQGSVPGA RRSGRRRPVD VDALAEGVLA GHRPTLARAI TLVESRRPDH ARAAQELLVR 
LLPHTGGARR VGITGVPGVG KSTFVDALGT RLTKAGHRVA VLAVDPSSTR TGGSILGDKT
RMERLAVDPD AFIRPSPTAG TLGGVARATR ETMLLMEAAG FDVVLVETVG VGQSEVAVAA
MVDCFCFLTL ARTGDQLQGI KKGVLELVDV VAVNKADGPH ADDARKAARE LSRALRLLQP
VHPDWRPPVL TCSGLTGDGL DEVWDAVTRH RAVLESDGAL AERRSRQGVS WMWDQVRDQL
MDAFLRDPRV ASLLPGTEER VRSGETTATL AARTLLDSFT RGRGVRPTGE PGGGAGPERG
TEPSREPGSS DG