Gene Clim_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1558 
Symbol 
ID6354206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1677509 
End bp1678669 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID642669164 
Productaminotransferase class I and II 
Protein accessionYP_001943586 
Protein GI189347057 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0317788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCTTT CTCTGGGCAT CCGGCACGGG TGCGTGATGC AATCGGAAAT TCGTGCCATG 
TCGATAGCGT GTGCAAAGGC CGGAGGAATC AATCTTTCTC AGGGAGTGTG TGATACGCCT
GTCCCTGAAG CGATATCCGG CAGCGTTGCT TCGGCCATTG AACAGGGATT CAATACATAT
TCGCATTATG CGGGGCTGAA AATCCTGCGG GACTCTGTTT GCGGAAAGCA GAAGCGGTTT
ACTGGTCTTG AATTCGATCC TGAAAGCGAG GTTATCGTCA GTGCGGGAGC AACAGGAGCC
ATGTATTGCG CTTTTCAGGC CCTGCTCAGT CCCGGAGATG AGGTTATCGT TTTCGAGCCA
TTTTACGGTT ACCATATCAG CACCCTTCTT ACAGCAGAAG CCGTTCCGGT TTTTGTGCCG
CTTTCACTGC CGGGCTGGAT CTTCATGCTT CATGATCTGG AGCATGCCGT GACGTCCAGG
ACGAAGGGCA TCATCGTCAA TACACCCTCC AATCCTTCAG GAAAAGTTTT CAGCCGCGAG
GAACTCGAGG TGATTGCCTC GTTTGCGGAG CGATATGATC TTTTCGTCTT TACCGATGAG
ATCTACGAGC ATTTCATTTA TGAAGGCAAC CATACCTCCT TTGCCACACT GCCAGGCATG
AAATCCCGAA CGATCACCGT GTCAGGATTC TCGAAAACCT TCAGCATAAC CGGCTGGCGT
CTTGGTTATG CGCTTTGCGA CGCACGTTGG GCTCAGGCAA TAGGGTATTT CAACGATCTT
GTCTATGTCT GTGCACCGGC ACCGCTGCAG GCAGGTGTTG CCGAAGGCAT GAGGCGGCTC
GATGACGGAT ATTATCGGAG GCTTTCCTTT GAATACAGCG AGAAACGGGA GCGTTTCTGC
GACGCGCTTT CTGTTGCGGG ACTAACGCCT CATGTTCCCG GAGGGGCATA CTATGTACTT
GCCGATGTCG GGCATCTGCC CGGCAGTTCT GCTGCGGAAA GAGCTCACTA TATTCTTGAA
AAAACCGGGG TGGCCTGCGT ACCTGGAAGT GCATTTTTCA GCTGCGGCAG GGGAGAGGAT
CTGGTGCGTT TCTGCTTTGC CAAAGACGAT ACGGTTCTTG ACGAAGCCTG CCGGCGTCTT
GCATCGATTC GAAATACGTA A
 
Protein sequence
MGLSLGIRHG CVMQSEIRAM SIACAKAGGI NLSQGVCDTP VPEAISGSVA SAIEQGFNTY 
SHYAGLKILR DSVCGKQKRF TGLEFDPESE VIVSAGATGA MYCAFQALLS PGDEVIVFEP
FYGYHISTLL TAEAVPVFVP LSLPGWIFML HDLEHAVTSR TKGIIVNTPS NPSGKVFSRE
ELEVIASFAE RYDLFVFTDE IYEHFIYEGN HTSFATLPGM KSRTITVSGF SKTFSITGWR
LGYALCDARW AQAIGYFNDL VYVCAPAPLQ AGVAEGMRRL DDGYYRRLSF EYSEKRERFC
DALSVAGLTP HVPGGAYYVL ADVGHLPGSS AAERAHYILE KTGVACVPGS AFFSCGRGED
LVRFCFAKDD TVLDEACRRL ASIRNT