Gene Francci3_4267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4267 
Symbol 
ID3907234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5090035 
End bp5091441 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content71% 
IMG OID637881593 
Productprolyl-tRNA synthetase 
Protein accessionYP_483342 
Protein GI86742942 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.975115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGTAC TCACCTCCCG CTCGACCGAC TTTCCGCGCT GGTATCAGGA CGTGCTCGCC 
AAGGCCGAGC TGGCCGACAA CGGCCCCGTC CGCGGGACGA TGGTCATCCG ACCGTACGGC
TACGCGATCT GGGAACGCAT GCAGGCCGAG GTGGACTCCC GAATCAAGGC CGCCGGGGCC
GTCAATGCCT ACTTCCCCCT GTTCATCCCC GAAAGCTACC TGCGCCGGGA GGCCGAGCAC
GTCGAGGGCT TCAGCCCGGA GCTGGCGGTG GTCACCATCG GTGGCGGCAA GGAGCTGGAG
GAGCCCGTAG TCGTCCGGCC CACCAGCGAG ACCGTGATCG GCGAATACCT GGCGAAGTGG
ACCCAGAGCT ACCGTGACCT GCCCCTGCTG CTCAACCAGT GGGCGAACGT GGTCCGGTGG
GAGCTGCGTC CCCGGCTGTT CCTGCGCAGC AGCGAGTTCC TCTGGCAGGA GGGCCACACC
GCGCACGCCG ACGAGGCCGA TGCCGCGGCC TACGCCCGTC GGATCGCGCT CGAGGTCTAC
CGCGACTTTA TGACGCAGGT GCTGGCGGTC CCGGTGTTCG TCGGAGTGAA GACGCGCCGG
GAACGGTTCG CCGGCGCGAC CAACACCATG ACCTGCGAGG GCATGATGGG CGACGGCAAG
GCTCTGCAGA TGGCGACCAG TCACGAGCTC GGCCAGAACT TCGCCCGTGC CTTCGACATC
GACTTCCTCG GCGCCGACGG AGCCCGGCAT CTGGCGTGGA CGACGTCGTG GGGCTGCTCG
ACCCGGATGG TCGGCGGGCT GATCATGGCA CATGGCGACG ACAACGGCCT GCGTGTCCCG
CCCCGGTTGG CGCCGACGCA GGTCGTGGTC CTGCCGGTGC GCGACGAGGA GACCGTCGTC
GCGAAGGCCC GCCAGATCGC CGCCGCCCTG ACCGACGCCG GTCTTCGGGT GCAGGTCGAC
GCCCGTCCCG GGTTGTCCTT CGGCCGGCGG GTCACCGACG CGGAGATCAA GGGCATCCCG
GTACGGGTTG AGGTGGGTCC GCGGGACCTG GCCGCGGGCA ACGTCACCCT GGTGCGCCGG
GACACCTCCG AGAAGGTGCC GGTGCCGCTG GCCGAGGTCG CCACGCGGGT GCCGGTGCTG
CTGGGCGAGG TGCAGGCCGA CCTGTACGCC GAGGCGCTGG CCCTACGCGA GAGCCGGACG
ACGGACGTCG CCACCGTTGC CGAGGCCGCC CGGGCCGCCC AGGCCGGCTT CGCCCGGATC
CCCTGGCGCC TTGTCGGCGA GGAGGGCGAG GCCGAGCTCG CCGAGGAGGC GCTCACCGTG
CGGTGCATCC AGACACCGGA CGGCGGGATC CCCGAGGCCG GCAGCGACGC CGACGACCTC
GTCTGCCTGA TCGCCCGCTC CTACTGA
 
Protein sequence
MAVLTSRSTD FPRWYQDVLA KAELADNGPV RGTMVIRPYG YAIWERMQAE VDSRIKAAGA 
VNAYFPLFIP ESYLRREAEH VEGFSPELAV VTIGGGKELE EPVVVRPTSE TVIGEYLAKW
TQSYRDLPLL LNQWANVVRW ELRPRLFLRS SEFLWQEGHT AHADEADAAA YARRIALEVY
RDFMTQVLAV PVFVGVKTRR ERFAGATNTM TCEGMMGDGK ALQMATSHEL GQNFARAFDI
DFLGADGARH LAWTTSWGCS TRMVGGLIMA HGDDNGLRVP PRLAPTQVVV LPVRDEETVV
AKARQIAAAL TDAGLRVQVD ARPGLSFGRR VTDAEIKGIP VRVEVGPRDL AAGNVTLVRR
DTSEKVPVPL AEVATRVPVL LGEVQADLYA EALALRESRT TDVATVAEAA RAAQAGFARI
PWRLVGEEGE AELAEEALTV RCIQTPDGGI PEAGSDADDL VCLIARSY