Gene NATL1_19731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_19731 
Symbolalr 
ID4779826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1629295 
End bp1630503 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content36% 
IMG OID640085264 
Productalanine racemase 
Protein accessionYP_001015793 
Protein GI161407967 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0787] Alanine racemase 
TIGRFAM ID[TIGR00492] alanine racemase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGTA TGTCAAAAAT CAATAGAAGT GAGAGATCTT TCACTTTTGG TGGGTTAACT 
GAAAATGTAA TTAGCCCTGA TCCTCGCAGT CGTGCATGGG TTGAGGTCGA TTCGAAAATT
ATCGAAAATA ATGCAAGAGT TTTGAAAAAC TTTATTGGCG ATGATTGTTC ATTGATGGCA
GTTGTGAAAG CTGATGGATA CGGACATGGT GCAGAGACTG TTGCTAAATC TGCGTTAACT
GGAGGTGCAG ATAGTCTTGG AGTAGCAACT TTAGAAGAGG GTATTCAATT AAGGAATGCT
GGCTTGAAGT GTCAGATATT AATTCTTGGA AATCTAATTA ATTCAGAAGA ACTTTATTCT
TCCTTTTGCT GGGATCTTAT TCCTACGATT AGTGGAATTC GTGAGGCAAT AATTTGCAAT
AATATCGCTG AAAATAAGCA TAAAAAATTT TTTATTCACT TAAAAGTTGA TACGGGTATG
ACGAGACTTG GCTGTGATTG TGATGAAGTA AAAGACTTGA TTTCTAAAAT TGATTATTTA
GAAAATATAT CCCTAAAGGG TATATATAGT CATTTAGCAA TTGCTGATAA AGATTTAGAG
AAGAATACAA AACTAAATTT TACACAAATT CAGGTAACTA GATTTGAAAA GGTTTTAAAG
GATTTGGGTG CTAGAAATAA GTCTTTATGT AGACACCTTG CAAATTCTGC GGGGACTCTT
TCAGATAGTC GTCTTCATTT TGACATGGTT CGAGTAGGAC TAAGCTTATA TGGTTATTTT
CCTGTAAATG ATTTTGAGTC TGATTTGAGA CTTAAACCTG CCTTGAAGGT CAAGTCTCGA
GTTACTTTGG TTAGGGAGGT AGAAAAAGGA ATAGGAGTGG GGTATGGACA TTTTTTTAAA
ACTCAAAGGA AAAGTAAACT TGCTGTAGTT GCTATTGGCT ACGCAGATGG CGTCAGCAGA
AATCTTTCTG GGAAAATATC AGCCTCAATA GATGGGGTTT TGGTTCCCCA AGTAGGTGCA
ATTGCAATGG ATCAAATGGT TTTTGACATA ACCGATAAAC CAGATATTAG AACAGGTCAA
GTTTTAACCC TTCTTGGTAC TGATGGTGCA GTTTGTATTT CACCCCAAAA TTGGTGTGAT
TTATCTGGAT CAATTCCATG GGAAGTTCTT TGTGGTTTCA GAAATCGTCT TCCTCGAGTT
GTCACTTAA
 
Protein sequence
MQSMSKINRS ERSFTFGGLT ENVISPDPRS RAWVEVDSKI IENNARVLKN FIGDDCSLMA 
VVKADGYGHG AETVAKSALT GGADSLGVAT LEEGIQLRNA GLKCQILILG NLINSEELYS
SFCWDLIPTI SGIREAIICN NIAENKHKKF FIHLKVDTGM TRLGCDCDEV KDLISKIDYL
ENISLKGIYS HLAIADKDLE KNTKLNFTQI QVTRFEKVLK DLGARNKSLC RHLANSAGTL
SDSRLHFDMV RVGLSLYGYF PVNDFESDLR LKPALKVKSR VTLVREVEKG IGVGYGHFFK
TQRKSKLAVV AIGYADGVSR NLSGKISASI DGVLVPQVGA IAMDQMVFDI TDKPDIRTGQ
VLTLLGTDGA VCISPQNWCD LSGSIPWEVL CGFRNRLPRV VT