Gene Arth_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3021 
Symbol 
ID4444388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3388517 
End bp3389647 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content65% 
IMG OID639690845 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_832500 
Protein GI116671567 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01745] aspartate-semialdehyde dehydrogenase, gamma-proteobacterial 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.458955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACAG CAGCTACCCC CTCCGTCGGC CTGGTCGGAT GGCGCGGCAT GGTCGGCTCC 
GTCCTGATGC AGCGCATGCA GGATGAAGGC GACTTCGCCA GCATCAACCC GGTGTTCTTC
TCCACCTCCA ACGCCGGAGG TGCGGCGCCA TCTTTTGCGG AAGGCGCAGG CAAGCTCGAG
GACGCGTTCG ACGTCGACAC GCTGGCTAAG CTGCCGATTA TTGTTACCGC CCAGGGCGGC
GATTACACCA AGCAGGTCCA CGGTGAGCTG CGCAGCCGCG GCTGGGACGG CCTCTGGATC
GACGCCGCCT CCACCCTGCG AATGAATGAC GACTCGATCA TCGTGCTGGA CCCCATCAAC
CGCGACGTCA TCGACAAGGG CCTCGCCAAC GGCACCAAGG ACTTCATCGG CGGAAACTGC
ACCGTGTCCT GCATGCTGAT GGGCCTGGGC GGCCTGTTCA AGAACGGCCT CGTCGAGTGG
GGCACCTCCA TGACCTACCA GGCTGCCTCC GGCGGCGGCG CCCGCCACAT GCGCGAGCTG
CTCAGCCAGT TCGGTACGCT CAACGCGGAG GTCAGCTCGG AACTGGACGA CCCGGCGTCG
GCCATCCTGG AAATTGACCG CAAGGTCCTG GCACACCAGC GCACCGACAT CGACGCGACG
CAGTTCGGCG TCCCGCTGGC CGGCTCGCTG ATCCCTTGGA TCGACGCGGA CCTCGGCAAC
GGCCAGTCCA AGGAAGAGTG GAAGGCCGGG GTCGAGACCA ACAAGATCCT CGGAACCTCC
GGCGAAAACC ACATCATCAT GGACGGCCTG TGCATCCGGA TCGGTGCGAT GCGTTCGCAC
TCCCAGGCCC TCACGCTCAA GCTCCGCGAA GACCTCTCCG TGGCCGAGAT CGAGAAGCTC
CTGGACGCGG ACAACGAATG GGCCAAGGTG GTGCCCAACA CCAAGGAAGA CTCCATGGCG
AGCCTGACCC CGGTGGCCGC CTCAGGGACG CTGGACATCC CGGTGGGCCG TATCCGCAAG
CTCGAAATGG GCCCGGCTTA CATCAGCGCC TTCACTGTGG GTGACCAGCT CCTCTGGGGC
GCCGCCGAAC CGCTGCGCCG CATGCTCAAC ATCGCCACCG GCACGCTCTA A
 
Protein sequence
MTTAATPSVG LVGWRGMVGS VLMQRMQDEG DFASINPVFF STSNAGGAAP SFAEGAGKLE 
DAFDVDTLAK LPIIVTAQGG DYTKQVHGEL RSRGWDGLWI DAASTLRMND DSIIVLDPIN
RDVIDKGLAN GTKDFIGGNC TVSCMLMGLG GLFKNGLVEW GTSMTYQAAS GGGARHMREL
LSQFGTLNAE VSSELDDPAS AILEIDRKVL AHQRTDIDAT QFGVPLAGSL IPWIDADLGN
GQSKEEWKAG VETNKILGTS GENHIIMDGL CIRIGAMRSH SQALTLKLRE DLSVAEIEKL
LDADNEWAKV VPNTKEDSMA SLTPVAASGT LDIPVGRIRK LEMGPAYISA FTVGDQLLWG
AAEPLRRMLN IATGTL