Gene OSTLU_38108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38108 
Symbol 
ID5004164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp373421 
End bp374923 
Gene Length1503 bp 
Protein Length500 aa 
Translation table 
GC content62% 
IMG OID640419585 
Productpredicted protein 
Protein accessionXP_001419983 
Protein GI145351221 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00699] 4-aminobutyrate aminotransferase, eukaryotic type
[TIGR03251] L-lysine 6-transaminase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0293051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCGC GCGCGCTCGA GCGATGCGCG CGACGACGCG CGCGCGAGGC GATTTCGCGC 
GCGTCGCGCG ACGCGCGCGC GTCGACGCGC GCGTCGTCGT CGTCGTCGCG CGCGGGGACG
TCGAACGGTG AACCAGCCGC CCCGGTGGTG CGGACGCCCA TCCCGGGACC GGCGTCGCGA
CGCGCCGTGG AAGCGCTCAG CGCGCACGCG GACGTGGGGT CGATACGATA CTTCGTCGAC
GTCGACGCTT CGCGCGGGAA TTACGTCGTC GACGCGGATG GGAACGCGGT GCTGGACCTG
TACGCGCACA TCGCGTCGCT GCCGGTGGGA TACAACCACG AAAAGATGCT CGCGGCGATG
CGAGACGAAG CGAACGTGGG GATTCTCGCG CACCGACCGG CGCTGGGGAA TAACCCACCG
ATCGGATGGG ACGACAGAGT GGCGCGAACG CTCATGCGAG TGGCGCCGAA AGGGTTGACG
CGCGCGACGA CGATGGCGTG CGGGGCGTGC GCGAACGAAC ACGCGATGAA GGCGGTATTC
ATAAGCGCGG CGAACGCGCG GCGGGGCGGA CGCGAGATAA GCGAGGAAGA AAAGGTGAGC
TGTCTGACGA ATCAAGCGCC CGGGTCGCCG GGATTTAAAG TTTTGTCGTT CGATGGTGCG
TTTCACGGAC GGACGGCGGC GTGTCTGTCG CTGACGCACA CAAAGTGGAT TCACAAGCTT
GATTTCCCGA CGTTTGACTG GCCGTCGTGC CCGTTTCCGA AATTAAAGTA CCCACTGGAT
AAATTTGAGC GAGAAAACGC CGAAGAGGAG GCGCGGTGTC TCGCTGAAGT CGAGAAGGCG
TTGACGCGAG ATCGAGACGT CGTCGCGGTG ATCGTCGAGC CCATGCAAGC GGAGGGCGGC
GATAATCACG CGAGCGCGGA TTTTTTCCGA AAGCTTCGCG CTTTGACGAA GAGAGAAAAC
GTCCGCATGA TCGTCGATGA GGTGCAGACT GGGTGTGGAT CGAGTGGGAC GTTTTGGGCG
CACGAAGCTT GGGGATTAGA ACATCCGCCG GACATTGTGA CGTTTAGCAA AAAAATGCAA
ATCGCGGGAT TCTACGCGGC CGCGGATCTC GCGCCCGAGC TCCCGTACCG CATCTTCAAC
ACGTGGATGG GTGATCCAGC AAAGCTCATT CAGCTCGAGG TTGTCCTCGA TTGCATAGAA
GAGCATCATT TATTGGACGT CGTGAAATCC GCCGGCGAGA CGCTCTTGAA TGGGTTGCGC
GAGTTACAGG AGAAATATCC GAGCATTCTC GCCAATGCGC GGGGCGTGGG CACGCTTGTC
GCCATCGATT GCGATACATC CGCGCGCAGG GACGCGCTGT TGCACGCACT CTTGCAAAAA
GGTGTCGACA TCGGCGGGTG CGGCTCGGCG ACCATTCGCG CGCGTCCGGG GCTCTTGTTT
ACTTCGGCGC ACGCGGGGGT GTTTTTAGAG CGGTTCGAGC GAGTCCTCGC TGCTGAAATG
TAG
 
Protein sequence
MLARALERCA RRRAREAISR ASRDARASTR ASSSSSRAGT SNGEPAAPVV RTPIPGPASR 
RAVEALSAHA DVGSIRYFVD VDASRGNYVV DADGNAVLDL YAHIASLPVG YNHEKMLAAM
RDEANVGILA HRPALGNNPP IGWDDRVART LMRVAPKGLT RATTMACGAC ANEHAMKAVF
ISAANARRGG REISEEEKVS CLTNQAPGSP GFKVLSFDGA FHGRTAACLS LTHTKWIHKL
DFPTFDWPSC PFPKLKYPLD KFERENAEEE ARCLAEVEKA LTRDRDVVAV IVEPMQAEGG
DNHASADFFR KLRALTKREN VRMIVDEVQT GCGSSGTFWA HEAWGLEHPP DIVTFSKKMQ
IAGFYAAADL APELPYRIFN TWMGDPAKLI QLEVVLDCIE EHHLLDVVKS AGETLLNGLR
ELQEKYPSIL ANARGVGTLV AIDCDTSARR DALLHALLQK GVDIGGCGSA TIRARPGLLF
TSAHAGVFLE RFERVLAAEM