Gene Clim_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0159 
Symbol 
ID6356129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp177706 
End bp178806 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content53% 
IMG OID642667786 
Product1-alkyl-2-acetylglycerophosphocholine esterase 
Protein accessionYP_001942237 
Protein GI189345708 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00519] L-asparaginases, type I 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGAA ACATGAAGAT AGCGGTACTC TATACCGGCG GAACCATCGG GTGTGAGGGT 
ATCCCGCTTG CTCCGGTGAG CGGTGAGCTT TTCAGAAAGA GAGTTTACGC TCTGCCCTGT
TTCAGGAACG GCAGCATCCG TTGTTCGGAG GGTGATATTG TGTTCAGCAT CGAATGGACG
GCAAATCCTG TGGACAGCTC GGATCTTATG CCGCTTGATT GGGTGGCAAT GGCTTCATGG
GTGCTTGAGC ATTATGCGTG TTATGATGGT TTTGTGATTC TGCATGGAAC CGATACCATG
TCGTGGAGTG CTTCGGCCCT CTCTTTTCTG CTTCAAGGGC TTTCCAAACC GGTTGTATTC
ACCGGATCGC AACTGCCGCT TGCATCCGGG CGAACCGATG CCGTGCAGAA TCTCCTTACA
GCGATCATGT TTGCGGCGAA TTTTCGAATT CCCGAAGTAA CTCTTTTTTT CGATCATCTT
CTTCTGAGAG GAAACCGTTC TGTCAAGGTG GATTCCCGTT CGTTCAATGC CTTTCTTTCG
CCGAACTATC CGGTACTCGG CAGTGCCGGA ACGGATATGA CCGTGAACCA CAGGGTTCTG
CTTGATCCTC CGGGGGGTTC TGTTTCATTG GACGAGCAAC GAAACCACGC TCTGCGGAGC
CGAGAGATTA CGGAGCTGAG TCGGGTACTG CCGGAGTATT CTGTGATTGC CCTGACGCTT
TTTCCGGGTA TTCAGGCCGG CATGGTGGAT GCTCTGCTTA CTCTTTCGCC TTCGTTGAAA
GGTATCGTGC TGAAATCGTT CGGTTCGGGC AATGCACCGG CTTCAAGCGG GTTTATCGAC
GCACTTGCAA GGGCTGCCGA TAAGGGTGTT GTGATTGTCG ATGCGACCCA GGTACTCTCA
GGCCGGGTAG AAATGAAGCG GTATGAAACC GGTTATCAAC TGCAACGAAA GGTGCATGCG
GTATGCGGAC ATGATCTTAC GGCTGAAGCG ACGCTTGCAA AGCTGATCTG TCTGACAGGG
AGAGCCATGA TCGATGGACA TGGCCGTGAA TCGGTTGAAC AGGGAATCGA AACCGTGCTC
TGCGGAGAGA TGACCCTGTA G
 
Protein sequence
MSGNMKIAVL YTGGTIGCEG IPLAPVSGEL FRKRVYALPC FRNGSIRCSE GDIVFSIEWT 
ANPVDSSDLM PLDWVAMASW VLEHYACYDG FVILHGTDTM SWSASALSFL LQGLSKPVVF
TGSQLPLASG RTDAVQNLLT AIMFAANFRI PEVTLFFDHL LLRGNRSVKV DSRSFNAFLS
PNYPVLGSAG TDMTVNHRVL LDPPGGSVSL DEQRNHALRS REITELSRVL PEYSVIALTL
FPGIQAGMVD ALLTLSPSLK GIVLKSFGSG NAPASSGFID ALARAADKGV VIVDATQVLS
GRVEMKRYET GYQLQRKVHA VCGHDLTAEA TLAKLICLTG RAMIDGHGRE SVEQGIETVL
CGEMTL