Gene Franean1_1694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1694 
SymbolalaS 
ID5670096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2025168 
End bp2027846 
Gene Length2679 bp 
Protein Length892 aa 
Translation table11 
GC content72% 
IMG OID641240612 
Productalanyl-tRNA synthetase 
Protein accessionYP_001506038 
Protein GI158313530 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID[TIGR00344] alanine--tRNA ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0569274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.130258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTTATC CCGACGACCC TGGACGACGT GAACCGATGG ACACGGCCGA GATCCGCCGC 
CGCTTTCTGA ACCATTTCTC CGAGCGGGGT CACACCGTGG TCCCGAGTGC GTCCCTGGTG
GCGCAGGACC CGACCCTGCT GCTGGTGAAC GCCGGCATGG TTCCCTTCAA GCCCTACTTC
CTCGGGGACC TGAAGGCGCC GTGGAACCGT GCGACCAGCG TGCAGAAATG CGTGCGGACC
GTGGACATCG ACAACGTCGG CCGCACCGCC CGACACGCCT CCTTCTTCCA GATGTGCGGG
AACTTCTCCT TCGGTGACTA CTTCAAGGCC GAGGCGATCC CGTTCGCCTT CGAGCTGATC
GTCGACGGCT ACGGCTTCAA CCCCGACGAC CTGTGGGCCA CCGTCTACCT GGACGACGAC
GAGGCCGAGG CGATTTGGCG CACCCTGCTG CCCGCCGAGC GCATCCAGCG GCGCGGCAAG
AAGGACAACT TCTGGTCGAT GGGTGTGCCC GGCCCGTGCG GCCCGTGCAG CGAGATCTAC
TTCGACCGCG GCCCGGCGTA CGGGCGCGAG GGCGGCCCGG AGGCGGACGA GGACCGCTAC
CTGGAGATCT GGAACCTCGT CTTCATGCAG TTCGCGCGGG GCGAGGGCAG CGAGTACGGC
TACGAGATCG TCGGTGATCT GCCCGCCCGC AACATCGACA CCGGGATGGG CCTGGAGCGG
ATGGCCACCA TCCTGCAGGG TGTCGAGAAC CTGTACGAGA TCGACATCTC CCGTCCGGTG
CTCGACGCGG CCGGCCGGCT CACCGGCACC CGCTACGGCG CCGACCCGGA TTCCGACGTC
CGGCTGCGGG TCGTCGCCGA CCACACCCGG ACCGCGGCGA TGCTGATCTC GGACGGGGTC
TCGCCGTCCA ACGAGGGCCG CGGGTACGTC CTGCGGCGGA TGCTGCGCCG GGCGGTGCGT
GACGCCCGGC TGCTGGGCGC CCGTGAGCCG GTCATGGACG AGCTGTTCGG CGTGGTCCGC
GCGGCGATGG GCCCGATCTA CCCGGAGCTC GTCGACCAGG CCGAGGCGAT CACGGCGGTC
GCGGTCGCCG AGGAGACGGC TTTCCTGGAG ACGCTGCGCA CGGGCACCAC CCTCTTCGAC
ACCGCGGTCA CCCAGGCCCG GTCCAGCGGG TCGTCCCAGC TCAGCGGCGA GTCGGCGTTC
CGGCTGCACG ACACCTACGG GTTCCCGATC GACCTGACCA TGGACATGGC GGCCGACGCG
GGCCTGACCG TGGACGAGGC CGGCTTCCGC CGGCTGATGG AGCGCCAGCG CCAGGCGGCG
AAGGCCGACC GGGCGTCCCG CCGCATCGGC AACCTGGACC TCTCCGCCTT CCGGCCGATC
CTCGCCGCCT CCGGCCCGAC GACGTTCACC GGCTACACCG AGCTCGGGCG CGAGTCGGGC
ATCGTCGGCA TCGTCGGCAT CGGCGACGGC GACAGCCTGA CCGCGGCCGG CGAGGGGGAG
GAGGTCGGCA TCCTGCTCGA CGCGACCCCC TTCTACGCCG AGAGCGGTGG CCAGGAGGCC
GACCTGGGCC GGATCCGGTT CGACGGCGGC GAGGCCGAGG TGCTCGACGT CCAGCGCCCG
GTGCCCGACC TGGTCATGCA CCGGGTGAAG GTGCTCGGTG GCGAGCTGCG TGTCGGCGCG
GACGTGTTCG CCGAGGTGGA CGTCGAGCGC CGGCGCGCGG TGTCGCGCTC GCACACCGCC
ACCCACCTCG TGCACACCGC GTTCCGCCGG GCGCTCGGGG AGTCGGCGAC GCAGGCCGGG
TCGCTGAACT CGCCGGGCCG GCTGCGCTTC GACTTCCACG CGCTCGGCGC GGTGCCCGAC
TCCGTCCTCG CCGACGTCGA GGACGAGGTC AACGAGATCG CCCTGCGTGA TCTGGAGGTC
CGCTGGTACG TCACCTCTCA GGAGGAGGCG CGCCGGCTGG GCGCGATGGC GCTGTTCGGC
GAGAAGTACG GCGACCGGGT CCGTGTCGTG GACGTCGGGG ACTACGCCCG CGAGCTGTGC
GGTGGTACCC ATGTGGCCAG CTCGGCCCAG CTTGGCGCGA TCAAGCTGCT GTCCGAGTCG
TCGATCTCGG CCGGGACGCG CCGGGTGGAG GCGCTGGTCG GCATGGATGC CTTCCGGTTC
CTGGCCCGCG AGCACGTGCT CGTCTCGCAG CTCTCCAGCA CGCTCAAGGC CCGTCCGGAC
GAGCTCGCCG ACCGGGTCGC CGACATCGTC GGGCGGCTGC GAGACGCGGA GAAGGAGCTG
GAGCGGCTGC GGGCACAGGC GGTGCTGGCC GGCTCGGCGG CGCTCGCCGC CGGCGCCGAG
GACGTGGGGG GCGTGGCGCT GGTCACCGCG CAGGTGCCCG CGGGCACTCC GGCAGACGAC
GTCCGCCTGC TCGCCCTGGA TGTGCGCGGC CGGCTCGCCG GCCGGCCGGC GGTGGTCGCG
GTCGTCGAGG CCGCCGGCGC GGCAGTCGTC GTGGCGACCG ACGAGACCGC GCGGACCCGC
GGCCTGCGGG CCGGCGACCT GGTCCGGCAC TCCTGGGCCG CGCTCGGAGG CAAGGGCGGC
GGCAAGCCTG ACGTCGCCCA GGGCGGACGC GGTGACGCGG ACATGATCCC GAAGGTCTTC
GCCCGGCTGC GCGAGCTGGT CGCCGACCAG AGCGCGTGA
 
Protein sequence
MPYPDDPGRR EPMDTAEIRR RFLNHFSERG HTVVPSASLV AQDPTLLLVN AGMVPFKPYF 
LGDLKAPWNR ATSVQKCVRT VDIDNVGRTA RHASFFQMCG NFSFGDYFKA EAIPFAFELI
VDGYGFNPDD LWATVYLDDD EAEAIWRTLL PAERIQRRGK KDNFWSMGVP GPCGPCSEIY
FDRGPAYGRE GGPEADEDRY LEIWNLVFMQ FARGEGSEYG YEIVGDLPAR NIDTGMGLER
MATILQGVEN LYEIDISRPV LDAAGRLTGT RYGADPDSDV RLRVVADHTR TAAMLISDGV
SPSNEGRGYV LRRMLRRAVR DARLLGAREP VMDELFGVVR AAMGPIYPEL VDQAEAITAV
AVAEETAFLE TLRTGTTLFD TAVTQARSSG SSQLSGESAF RLHDTYGFPI DLTMDMAADA
GLTVDEAGFR RLMERQRQAA KADRASRRIG NLDLSAFRPI LAASGPTTFT GYTELGRESG
IVGIVGIGDG DSLTAAGEGE EVGILLDATP FYAESGGQEA DLGRIRFDGG EAEVLDVQRP
VPDLVMHRVK VLGGELRVGA DVFAEVDVER RRAVSRSHTA THLVHTAFRR ALGESATQAG
SLNSPGRLRF DFHALGAVPD SVLADVEDEV NEIALRDLEV RWYVTSQEEA RRLGAMALFG
EKYGDRVRVV DVGDYARELC GGTHVASSAQ LGAIKLLSES SISAGTRRVE ALVGMDAFRF
LAREHVLVSQ LSSTLKARPD ELADRVADIV GRLRDAEKEL ERLRAQAVLA GSAALAAGAE
DVGGVALVTA QVPAGTPADD VRLLALDVRG RLAGRPAVVA VVEAAGAAVV VATDETARTR
GLRAGDLVRH SWAALGGKGG GKPDVAQGGR GDADMIPKVF ARLRELVADQ SA