Gene Franean1_0768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0768 
Symbol 
ID5669184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp893601 
End bp894662 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content70% 
IMG OID641239695 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_001505132 
Protein GI158312624 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0510321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0451069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGGGAGA GCGCGATCTG GTGGGCCGTA GAATCACCCG ACATGACGCA GGACCGGCCA 
CGGGTCTCGC TGACCGGCAT CAAGCCGACC GGCGATCCGC ACCTGGGCAA CTACATCGGG
GCGATCCGCC CCGCCCTCGA CCTGGCGGCG ACGTACGAGT CGATCTACTT CATCGCCGAC
TACCACGCCC TGACCTCCAT CCGGGACAGG GCGAAGTTCG CCGCCTACAC CCGGTCCGTC
GCCGCCACCT GGATCACGCT CGGGCTCGAT CCCGAGCGCA CGGTCTTCTA CCGGCAGTCC
GACGTCCCGG AGATCTTCGA GCTGACCTGG ATCCTGTCCT GTGTCACGGG CAAGGGCCTG
ATGAACCGGG CGCACGCCTA CAAGGCGGCG CGGGACCGCA ACGCCGAGAG CGGCGTCGCC
GACCTCGACG CGGGCGTCAA CATGGGGCTG TTCAACTACC CCATCCTGAT GGCCGTCGAC
ATCCTCGTCA TGGGCGCGGA CGTCGTCCCC GTCGGCCAGG ACCAGTCGCA GCACCTCGAG
ATCGCCGCGG ACATCGCCGG CTCGTTCAAC CACCTCTACG GCGACGTGTT CAGCCTGAAG
ATTCCCGAGG CGGTGCTGCC GTCCGGGGCC AACGCGCGGA CGATGCCCGG CACCGACGGC
CGGAAGATGA GCAAGTCGTA CGGGAACACG ATTCCGCTGT TCGCGCCGCC GTCCCAGCTG
CGCAAGCTGG TGCGCGGCAT CCGCAGCGAC AGCACGCCGG TCGAGGCGCC GAAGGATCCG
GACGCCTCCG CCGCCTTCCA GATCTACGAG AACTTCGCGG ACCCGGAGGC CGTCAAGGAC
ATGCGGGTCC GCCTCGAGCA GGGCGGCACC GGCTGGGGCG AGCTGAAGAA CGCCCTGTTC
GAGACGCTCG ACGCCTGGCT GACCCCACTG CGGGCCCGCT ACGACGAGCT GGTGGCCCCG
GGCAGCGAGC TGGACGCGAT CCTCGCCGCC GGCGCGGACA AGGCCCGCGA CCGCGCCCGC
CCCGTCCTGG CCGGCGCCCG CCGCGCGATC GGCGTCGGCT GA
 
Protein sequence
MGESAIWWAV ESPDMTQDRP RVSLTGIKPT GDPHLGNYIG AIRPALDLAA TYESIYFIAD 
YHALTSIRDR AKFAAYTRSV AATWITLGLD PERTVFYRQS DVPEIFELTW ILSCVTGKGL
MNRAHAYKAA RDRNAESGVA DLDAGVNMGL FNYPILMAVD ILVMGADVVP VGQDQSQHLE
IAADIAGSFN HLYGDVFSLK IPEAVLPSGA NARTMPGTDG RKMSKSYGNT IPLFAPPSQL
RKLVRGIRSD STPVEAPKDP DASAAFQIYE NFADPEAVKD MRVRLEQGGT GWGELKNALF
ETLDAWLTPL RARYDELVAP GSELDAILAA GADKARDRAR PVLAGARRAI GVG