Gene Apre_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1444 
Symbol 
ID8398254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1562883 
End bp1563893 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content42% 
IMG OID644995809 
Productasparagine synthetase AsnA 
Protein accessionYP_003153188 
Protein GI257066932 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2502] Asparagine synthetase A 
TIGRFAM ID[TIGR00669] aspartate--ammonia ligase, AsnA-type 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGT TAGTTATACC AGAAAATTAT AAAAGCAATG AAGATTTATA TAGAACCCAA 
CTGTTGATCA AGGAAATCAA AGACTATTTC CAAATCAACC TAGCCAACAA CCTAAACCTA
AAGCGTGTAT CTGCTCCCCT ATTCGTTTCA GAAACATCAG GTCTTAACGA TAACCTAAAC
GGAGTAGAAA AACCTGTAAC CTTCGACCTT CCAGAAGCTC ACAACGCCGA GATGGAAATC
GTCCACTCCC TTGCCAAATG GAAGAGATAC GCCCTAGAAG AATACAACTT CAAAACCCAC
GAGGGCCTTT ACACAGACAT GAACGCCATA AGACGCTGCG AAGAACCAGA CAACACCCAC
TCCTTCTACG TCGACCAATG GGACTGGGAA CTAATCATGA ATGAAGAAGA CAGAAACGTG
GACTACCTCA AACAAATCGT AGAAACAATC TACAGGACTA TGAAATCCCT AGACGAATAC
CTCTGCACTC TTATTCCAAC TAGACAAAAG CTCCTCAAAG ATCAAATTAG ATTTATGACA
AGCGAAGAGC TCCTCCAAAA ATATCCAGGC AAAAACGATA AGGAAAGAGA AAGATTAGCG
GTCAAAGAAT ACGGAGCAGT TTTCCTAATG CAAATAGGAA AAGTCCTATC AAACGGAGAA
AAACACGACC TCCGTGCCCC AGACTACGAC GATTGGGAAC TAAACGGGGA CATCCTTGTA
TATAACCCTG TACTAGACGA TGTCCTAGAA CTATCATCCA TGGGCATCAG AGTCAACCCA
GAAAGACTAA ACGAGCAACT AAAACAAACA GACAACCTAG ACAGACTAAA ATTCGACTAC
CACAGGATGC TAATAGACGG CAAACTCCCA CAAACCATAG GAGGCGGAAT CGGCCAATCA
AGACTATGTA TGTTCTTCCT CCAAAAAGCC CACATAGGAG AAGTCCAAGT ATCCTACTGG
CCAGACGAAC AAAGAAAAGC TCTAGCCAAC AAGGGAATCA AACTATTATA G
 
Protein sequence
MSKLVIPENY KSNEDLYRTQ LLIKEIKDYF QINLANNLNL KRVSAPLFVS ETSGLNDNLN 
GVEKPVTFDL PEAHNAEMEI VHSLAKWKRY ALEEYNFKTH EGLYTDMNAI RRCEEPDNTH
SFYVDQWDWE LIMNEEDRNV DYLKQIVETI YRTMKSLDEY LCTLIPTRQK LLKDQIRFMT
SEELLQKYPG KNDKERERLA VKEYGAVFLM QIGKVLSNGE KHDLRAPDYD DWELNGDILV
YNPVLDDVLE LSSMGIRVNP ERLNEQLKQT DNLDRLKFDY HRMLIDGKLP QTIGGGIGQS
RLCMFFLQKA HIGEVQVSYW PDEQRKALAN KGIKLL