Gene Ndas_5213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5213 
Symbol 
ID9249106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp364361 
End bp365626 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content69% 
IMG OID 
Product2,4-diaminobutyrate 4-transaminase 
Protein accessionYP_003683099 
Protein GI297564126 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.323098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACCT TCCAGCGCCT GGAGTCCGAA GTCCGCGGAT ACTGCCGGAA CTGGCCGGTC 
GTGTTCGACC GGGCCGTCGG TAGCCACGTC TACTCCGAGG ACGGCAAGCC CTACCTCGAC
TTCTTCGCGG GTGCGGGGTC GCTCAACTAC GGGCACAACA ACCCCGAGCT GAAGACCTCG
CTCATCGAGT ACCTGACCGA CGACAAGATC GTGCACAGCC TCGACGCCTA CAGCGTGGCC
AAACGCGAGT TCCTGAAGAC CTTCGAGGAG ATCATCCTCA AACCCCGGGG CCTCGACTAC
AAGGTCCAGT TCCCCGGACC CGCGGGCAAC AACGCGGTCG AGGCCGCGCT CAAGCTGGCC
CGCAAGTACA CCGGTCGCGA GACCATCGTC AACTTCACCA ACGGCTTCCA CGGCATGACC
CTGGGCGCCC TGGCCGTCAC CGGCAACTCG ATGAAGCGCG GCGGCGCGGG CGTGCCGCTG
GGCCACGTCG CCACGATGCC GTTCGACAAC TACCTGGACG GCAAGACGCC GGACTTCCTG
TGGCTGCGCA GCCTGCTGGA CGACAGCGGC AGCGGCCTGG ACAAGCCCGC GGCCGTCATC
GTCGAGACGG TCCAGGGCGA GGGCGGCATC AACGCCGCCA GCGCCCAGTG GCTGCGCGAG
CTCTCGGACC TGTGCCGCGA GTACGGCATC CTCATGATCG TCGACGACAT CCAGATGGGC
TGCGGCCGCA CCGGCGACTT CTTCAGCTTC GAGGAGGCCG GGATCACCCC GGACATCGTC
ACGCTGTCCA AGTCCATCAG CGGCTACGGC CTGCCCATGG CCCTCACCCT GTTCAAGCGC
GAGCTGGACG TGTGGGAGCC GGGTGAGCAC AACGGCACCT TCCGCGGGTT CAACCCGGCC
ATGGTGACCG CCGTCGGGGC CCTGCGCCGC TACTGGAGCG ACTCGGCCTT CTCCGACTCC
GTCAAGGCCA AGGGCGACAT GGTCGCCGCC CGCCTGGCCG AGATGGCCGC CGAGCACGCC
GAGTTCGGCG CGCACGTGCG CGGCCGCGGC CTGGCCCGGG GCCTGGCCTT CGAGCAGACC
GACATCGCCA AGAAGGTCGC CGCCGAGTCC TTCGAGCGGG GCCTGCTCCT GGAGACCTCC
GGCCCCGAGG ACGAGGTGGC CAAACTCCTG CCGCCGCTCA CGGCGAGCGA GGAGGAACTC
ACGGCCGGTC TTGACATCAT GGCTGACGCG GCCCGCGCCG CGGTCAAGGC GGCCCAGCCC
GCCTAG
 
Protein sequence
METFQRLESE VRGYCRNWPV VFDRAVGSHV YSEDGKPYLD FFAGAGSLNY GHNNPELKTS 
LIEYLTDDKI VHSLDAYSVA KREFLKTFEE IILKPRGLDY KVQFPGPAGN NAVEAALKLA
RKYTGRETIV NFTNGFHGMT LGALAVTGNS MKRGGAGVPL GHVATMPFDN YLDGKTPDFL
WLRSLLDDSG SGLDKPAAVI VETVQGEGGI NAASAQWLRE LSDLCREYGI LMIVDDIQMG
CGRTGDFFSF EEAGITPDIV TLSKSISGYG LPMALTLFKR ELDVWEPGEH NGTFRGFNPA
MVTAVGALRR YWSDSAFSDS VKAKGDMVAA RLAEMAAEHA EFGAHVRGRG LARGLAFEQT
DIAKKVAAES FERGLLLETS GPEDEVAKLL PPLTASEEEL TAGLDIMADA ARAAVKAAQP
A