Gene Franean1_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1102 
Symbol 
ID5669516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1315629 
End bp1317302 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content72% 
IMG OID641240034 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001505464 
Protein GI158312956 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.828904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCACG ACACCCCAGC TCCGCTCGGT TCAGCTCCGC TCGGCTCCGC TCACCTCGGT 
GCCGCCGGCG GGCTCCCGTT CGACCGGGAG TCGCTGCACA TCTACGACAC CACGCTGCGC
GACGGCACCC AGCAGGAAGG CCTGTCGCTG TCGGTCGCCG ACAAGCTGGC GGTCGCCCGG
CACCTCGACG ACCTGGGTGT CGGCTTCATC GAGGGCGGCT GGCCCGGCTC GAACCCCAAG
GACGCCGAGT TCTTCCGGCG GGCCCGCACC GAGCTCGACC TCAACGGCGC GCTGCTGACC
GCGTTCGGGT CGACCCGGCG GGCCAGCAAG GCCGTCGCCG ACGACTCCCA GGTCGCCGCG
CTGCGCGACG CCGGCACCTC CGTGGTCTGC CTGGTGGCCA AGGCCGACCG CCGGCACGTC
GAGCGCGCGC TGCGGACCAC GCCCGCCGAG AACCTCGCGA TGATCCGCGA CACCGTCCGT
CACCTGACGA ACGAGGGCAA GCGGGTCTTC GTCGACGCCG AACACTTCTT CGACGGCCAC
CGCGCCGACC CCGCCTACGC GCTCGAGATG GTGCGCACCG CGGCCGAGGC CGGTGCCGAG
GTGATCGTGC TGTGCGACAC CAACGGCGGC ATGCTGCCCA CCCGGATCGG TGACGTCGTG
GCGGCCACGC TCGCGAGCAC CGGAGCGCGC CTGGGTATCC ACACCCACGA CGATGCCGCC
TGCGCCGTCG CGAACAGCCT GGTGGCGATC GAGGCCGGGG CCACCCACGT CCAGGGGACC
GCCAACGGCT ACGGCGAGCG TTGCGGGAAC GCCAACCTGT TCAGCGTCGT CGCGGGCCTG
GAGACCAAGC TGGGCCGCCA GGTCCTGCCG GCCGGGCGGC TGCGTGAGCT CGTGCGTGTC
TCACACGCCA TCGACGAGGT CACCAACTCG GCCCCGAGCA CGCACCGGCC CTACGTCGGC
GCCAGCGCCT TCGCGCACAA GGCCGGCCTG CACGCGAGCG CCGTCAAGGT CGACCCCGAC
ATGTATCAGC ACATCGACCC GGCCGCCGTC GGCAACGACA TGCGGATGCT CGTCTCCGAA
CTGGCCGGCC GCGCGACCCT CGAGCTCAAG GGCCGCGAGC TCGGCATGGA CCTCTCCGGG
GAGCGTGAGG CGCTCGGTCG GGTGCTGGAG ATGGTCAAGG ACAGGGAGGC CTCCGGCTAC
GCCTACGAGG CCGCCGAGGC GTCCTTCGAG CTCATGCTGC TGGACGAGGT CTCGGGCCGG
GAGCGGTTCT TCACCCTGGA ATCCTGGCGG GTCATCGTCG AGCAGCGCTC CGGCGGCGAG
GTCGTCAGCG AGGCCACGGT GAAGCTCACC TCCCACGGCG AGCGGCACGT GTCGACGGCG
GAGGGCAACG GGCCCGTCAA CGCGCTCGAC ACCGCCCTGC GCAAGGCGCT GGAGAAGGCC
TACCCGGGCC TGGCCGATCT CGACCTGGTC GACTACAAGG TCCGCATCCT CGACGGCCGG
CAGGGCACCG GTGCGGTCAC CCGCGTCCTG GTGGAGACCA GCGACGGCCG CGGCCGCTGG
GACACCATCG GCGTCGACGA GAACATCATC GCCGCCTCCT GGGTGGCGCT GCAGGACGCC
GTCACCTACG GCCTACGCCG CCAGGGTGAG CGCCCCGACC CGGACGCCGT CTGA
 
Protein sequence
MVHDTPAPLG SAPLGSAHLG AAGGLPFDRE SLHIYDTTLR DGTQQEGLSL SVADKLAVAR 
HLDDLGVGFI EGGWPGSNPK DAEFFRRART ELDLNGALLT AFGSTRRASK AVADDSQVAA
LRDAGTSVVC LVAKADRRHV ERALRTTPAE NLAMIRDTVR HLTNEGKRVF VDAEHFFDGH
RADPAYALEM VRTAAEAGAE VIVLCDTNGG MLPTRIGDVV AATLASTGAR LGIHTHDDAA
CAVANSLVAI EAGATHVQGT ANGYGERCGN ANLFSVVAGL ETKLGRQVLP AGRLRELVRV
SHAIDEVTNS APSTHRPYVG ASAFAHKAGL HASAVKVDPD MYQHIDPAAV GNDMRMLVSE
LAGRATLELK GRELGMDLSG EREALGRVLE MVKDREASGY AYEAAEASFE LMLLDEVSGR
ERFFTLESWR VIVEQRSGGE VVSEATVKLT SHGERHVSTA EGNGPVNALD TALRKALEKA
YPGLADLDLV DYKVRILDGR QGTGAVTRVL VETSDGRGRW DTIGVDENII AASWVALQDA
VTYGLRRQGE RPDPDAV