Gene Ndas_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3663 
Symbol 
ID9247532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4397423 
End bp4398760 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content72% 
IMG OID 
Product4-aminobutyrate aminotransferase 
Protein accessionYP_003681567 
Protein GI297562593 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGA CCGAGGTCGC CCAGTCCCGC CGGATCGTCA CCGAGATCCC CGGCCCCAAG 
TCCCGCGCGA TCCAGGAGCG CCGCCGTTCG GCCGTCGCCC AGGGCGTGGG CAGCGTCCTG
CCGGTCTACG TCGAGCGCGC GGGCGGCGGC ATCGTCGAGG ACGTCGACGG CAACGCGCTG
ATCGACTTCG GCTCCGGCAT CGCCGTGACC AACGTCGGCA ACGCCGACCC GCGCGTGGTG
GAGCGCGCCG CCGAGCAGCT CGGCCGGTTC ACGCACACCT GCTTCATGGT CAACCCGTAC
GAGGCGTACG TGGACGTGTG CGAGGCACTC AACCGGATCA CGCCGGGCGA CCACGAGAAG
CGCTCGATCC TGCTCAACTC GGGCGCCGAG GCGGTCGAGA ACGCGGTGAA GATCGCCCGC
AGCGCGACCG GCCGCCAGGC GGTCGTGGTG TTCGACCACG CCTACCACGG CCGCACCAAC
CTCACCATGG GGCTGACCGC CAAGAACATG CCCTACAAGC AGGGCTTCGG GCCGTTCGCC
GGTGAGATCC ACCGGATGCC GATGGCCTAC CCGTACCGCT GGCCGACGGG CCCGGACAAC
TGCGGCCCCG AGGCGGCGGC CATGGTGATC GAGCAGATCA CCAAGCAGAT CGGCGCCCAG
AACGTGGCGG CCGTGGTGAT CGAGCCGATC CAGGGCGAGG GCGGCTTCAT CGAGCCCGCC
CCCGGCTTCC TGCCCGCGGT GGTGGAGTTC TGCCGCGCCA ACGGCATCGT GTTCGTCGCC
GACGAGGTGC AGACCGGCTT CGCCCGCACC GGCCACATGT TCGCCAGCGA GCACGAGGGC
GTGGTCCCGG ACCTGATCAC GACCGCCAAG GGCATCGCGG GCGGCCTGCC GCTGGCCGCG
GTGACCGGCC GCGCCGAGCT GATGGACGCC GTGCACGGCG GCGGCCTGGG CGGCACCTAC
GGCGGCAACC CGGCCGCGTG CGCCGCCGCG CTGGCCGCGC TGTCGGCGAT CGAGTCCGAC
GGCCTGGTGG AGCGCGCCCG TGAGATCGGC GAGCTGATGC TGGGCCGCCT GCGCGAGCTG
GCCGCCAAGT ACGAGGTCAT CGGCGACGTG CGCGGACGCG GCGCGATGAT CGCGATCGAG
CTGGTCCAGG ACGCCGACCG CACGCCCGCC CCCGAGGCGC TGGCCAAGGT CCTGTCCTAC
TGCCACTCCC GCGGCCTGGT CCTGCTGAGC GCGGGCACCT ACGGCAACGT GATCCGCATG
CTGCCGCCGC TGGTGATCGG CGACGAGCTG CTGCACGAGG GCCTGGACAT CCTGGAGGAG
GCCTTCGCCC GGCTGTAG
 
Protein sequence
MAATEVAQSR RIVTEIPGPK SRAIQERRRS AVAQGVGSVL PVYVERAGGG IVEDVDGNAL 
IDFGSGIAVT NVGNADPRVV ERAAEQLGRF THTCFMVNPY EAYVDVCEAL NRITPGDHEK
RSILLNSGAE AVENAVKIAR SATGRQAVVV FDHAYHGRTN LTMGLTAKNM PYKQGFGPFA
GEIHRMPMAY PYRWPTGPDN CGPEAAAMVI EQITKQIGAQ NVAAVVIEPI QGEGGFIEPA
PGFLPAVVEF CRANGIVFVA DEVQTGFART GHMFASEHEG VVPDLITTAK GIAGGLPLAA
VTGRAELMDA VHGGGLGGTY GGNPAACAAA LAALSAIESD GLVERAREIG ELMLGRLREL
AAKYEVIGDV RGRGAMIAIE LVQDADRTPA PEALAKVLSY CHSRGLVLLS AGTYGNVIRM
LPPLVIGDEL LHEGLDILEE AFARL