Gene Jann_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1959 
Symbol 
ID3934410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1952908 
End bp1954530 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content64% 
IMG OID637904313 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_509901 
Protein GI89054450 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00276061 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.544907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAT CCCGCCTCTA TCTTTACGAC ACCACCCTGC GCGACGGGCA GCAGACCCAG 
GGCGTCCAGT TTTCCACGCC CGAGAAGGTC CGCATCGCCG AGGCGCTCGA CGCCCTCGGC
CTCGACTATA TCGAAGGCGG CTGGCCCGGC GCGAACCCCA CCGACAGCGC GTTTTTCGAC
GATGCCCCCA GGACCCGCGC CCGGATGACG GCCTTCGGGA TGACCAAGCG GGCCGGGCGC
TCGGCTGAGA ATGACGACGT TCTGGCCGCC GTCCTGAATG CAGGCACAGG GACCGTCTGC
CTTGTCGGCA AGTCCCATGA TTTTCACGTG ACCGAAGCGC TTGGCATCAC CTTGGCGGAG
AATGTCGAGA ATATCCGCGC CTCCATCGCC CACCTCGTCG CCCAGGGCCG CGAGGCGATC
TTTGACGCGG AGCATTTCTT TGACGGCCAC AAGGCCAACC CGGACTATGC GCTGGAATGC
ATCCATGCCG CCTACAGCGC AGGCGCGCGG TGGGTGGTGC TGTGTGACAC CAACGGCGGG
ACGTTGCCTG ACGAGATTGC CGGGATCACC CACGCCGTCA TCGCCTCAGG TCTCCCCGGG
GAATGTCTGG GCATCCATAC GCATAACGAT ACCGAAACCG CCGTGGCGGG CAGCCTCGCG
GCGGTGCAGG CGGGCGCGCG TCAGATCCAG GGCACGTTGA ATGGTTTGGG GGAGCGCTGC
GGCAACGCCA ATCTGACCAC GCTGATCCCG ACCCTTTTGC TGAAAGAGCC GTACCGGTCG
TCCTATGACA CCGGGATCGC GGTGGAGACG TTGGATGGCC TGACGCGACT GAGCCGGATG
CTGGACGATA TCCTCAACCG GGTGCCGCAC CGACAAGCGC CCTATGTGGG GGCCTCGGCC
TTCGCCCATA AAGCGGGCCT CCACGCCAGC GCCATCGTCA AGAACCCCAC GACCTACGAG
CATATCGCGC CCGAAATCGT CGGCAATGAC CGCATCATCC CGATGTCCAA TCAGGCGGGC
CAGTCGAACC TGCGCAAGCG TCTGGCGGAG GCGGGGTTGG AGGTCGAAAG GGACAACCCG
GCGCTGCCTG CGATCCTCGA TGCGATCAAG GACCGGGAGG CGGACGGGTA CAGCTTTGAC
ACGGCCCAGG CCTCCTTCGA GTTGCTGGCG CGGCGCGCGT TGGGGATGTT GCCCGCGTTC
TTTGAGGTCA AGCGCTACCG CGTCACGGTG GAGCGGCGGA AGAACAAGTA CAATCAGATG
GTGAGCCTTT CGGAGGCGAT GGTCGTCCTG AAAGTGGGCG GCGAAAAGAA GCTGTCGGTC
AGCGAAAGCA TGGACGAGAC GGGCAGCGAC CGGGGGCCGG TGAACGCATT GTCGAAGGCT
CTGGCGAAGG ATCTGGGGCC GTATCAGGAA TTCATCAAGG ATATCCGCCT TGTGGACTTC
AAGGTCCGCA TCACCCAAGG CGGCACAGAG GCCGTCACCC GCGTGATCAT CGACAGCGAG
GACAGTCAGG GGCGGCGCTG GTCGACCGTG GGCGTATCCC CCAACATCGT GGATGCGAGT
TTCGAGGCGC TGCTGGACGC GGTGAACTGG AAACTGATCC ACGAAGGCGC AGCGGTCCTA
TGA
 
Protein sequence
MTKSRLYLYD TTLRDGQQTQ GVQFSTPEKV RIAEALDALG LDYIEGGWPG ANPTDSAFFD 
DAPRTRARMT AFGMTKRAGR SAENDDVLAA VLNAGTGTVC LVGKSHDFHV TEALGITLAE
NVENIRASIA HLVAQGREAI FDAEHFFDGH KANPDYALEC IHAAYSAGAR WVVLCDTNGG
TLPDEIAGIT HAVIASGLPG ECLGIHTHND TETAVAGSLA AVQAGARQIQ GTLNGLGERC
GNANLTTLIP TLLLKEPYRS SYDTGIAVET LDGLTRLSRM LDDILNRVPH RQAPYVGASA
FAHKAGLHAS AIVKNPTTYE HIAPEIVGND RIIPMSNQAG QSNLRKRLAE AGLEVERDNP
ALPAILDAIK DREADGYSFD TAQASFELLA RRALGMLPAF FEVKRYRVTV ERRKNKYNQM
VSLSEAMVVL KVGGEKKLSV SESMDETGSD RGPVNALSKA LAKDLGPYQE FIKDIRLVDF
KVRITQGGTE AVTRVIIDSE DSQGRRWSTV GVSPNIVDAS FEALLDAVNW KLIHEGAAVL