Gene Arth_1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1078 
Symbol 
ID4446416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1164864 
End bp1165841 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content70% 
IMG OID639688884 
Productdihydrodipicolinate synthetase 
Protein accessionYP_830572 
Protein GI116669639 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACCC CGGCACGCGA GTCACTGTCA CCGGGCGTCT GGGGCGTGGT GGCAACACCC 
TTCCAGGGCA GCACGCTGGA CGTGGACCTG GACAGCCTGT CCGGGCTCGT GGAGCACTAC
GAAGCTATCG GCGCTACCGG GCTGACCGTC CTGGGTGTGT TCGGCGAGGC CGCGGCGCTG
ACCGCGGAGG AACGCCGCCA GGTCCTTGAC ATCGCGGTCG AATGCACGGA CCTCCCGCTG
GTTGTCGGCA TCACTGCGCT GTCCGCCCGC CCCGCCATCG ACGAAATCAA GGCTGCCCAG
GCCGTGGCCG GCGAGCGCCT GGCCGCCGTG ATGGTCCAGG CCAACTCGCC CCGGCCCGAG
GTGGTGATCG CCCACCTGGA CGCCATCCAC CGGGCCACCG GCGCCAAGGT GGTGCTGCAG
GATTACCCCC TGGCCAGCGG CGTCAGCATC AGCACCCCTG CACTCATTTC CGTGGCGAAG
TGGTGCAGCT TCGTGGTCGC GGTGAAGGCG GAAGCGCCGC CCACCAGCGT GGCCATCGCG
GAGCTGACAG CCGCGCTGGT TGGCAGGGTG TCCGTCTTCG GCGGGCTCGG CGGGCAGGGA
CTGCTGGACG AGCTCATGGC CGGCGCGGCC GGCGCCATGA CCGGTTTCTC CTACCCGGAG
GCGCTGATCG CCTGTGTCCG GGCCTGGCAG CGCGACGGCT ATGAAGCCGC CCGCGACCAG
CTGCTGCCGT ACCTGCCGCT GATCAACTTC GAGCAGCAGG CAAAGATCGC CCTGGCCGTC
CGTAAGGAAT GCCTGCTGAA GCGCGGCTTA GTCAAGGACG CGGGCGTCCG GGCTCCGGCC
GCGGAGTTCC CGGAGAGGCT GCGCTACGGC ATGCTCACGC ACCTGCGGGA AGCCGCTGCC
GCGCTGGAAG CACGCGCGGA CGTACCCGCC GGTGCACCGC ACATTGCGGA CCACCACAGC
TCAGTAGGGA GTTTCTGA
 
Protein sequence
MDTPARESLS PGVWGVVATP FQGSTLDVDL DSLSGLVEHY EAIGATGLTV LGVFGEAAAL 
TAEERRQVLD IAVECTDLPL VVGITALSAR PAIDEIKAAQ AVAGERLAAV MVQANSPRPE
VVIAHLDAIH RATGAKVVLQ DYPLASGVSI STPALISVAK WCSFVVAVKA EAPPTSVAIA
ELTAALVGRV SVFGGLGGQG LLDELMAGAA GAMTGFSYPE ALIACVRAWQ RDGYEAARDQ
LLPYLPLINF EQQAKIALAV RKECLLKRGL VKDAGVRAPA AEFPERLRYG MLTHLREAAA
ALEARADVPA GAPHIADHHS SVGSF