Gene Francci3_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1991 
Symbol 
ID3903699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2340804 
End bp2341976 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content65% 
IMG OID637879327 
Productmethionine synthase, vitamin-B12 independent 
Protein accessionYP_481094 
Protein GI86740694 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0620] Methionine synthase II (cobalamin-independent) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCA GCAGCGACCG CATCCTGACC ACGCACACCG GCAGCCTGCC CCGACCGGCC 
GGGCTGGCCG AGCTGATCCG GGCCCGGGAA CAGGAGACCC TCTCGGTCGC GGACGCCGAG
TACCTGCCCG AGCGGATCGC GGACGCGGTT GGCGTGGTCG TCGGCCATCA GGCGCAGGTC
GGGCTGGACG TGATCAGCGA TGGCGAAATG AGCAAGATCG GGTACGCCAC CTACGTCAAA
GAACGCCTCA CCGGTTTCGA CGTGGACGTT GCCGTTCCCG AGGGCGGCGG CCTGTCGATC
GCTGATCTGG ACGACTACCC TGGCATGGCC GAACGTTCCC TGGCCGGCTT GGAGACCGCG
ACACCGACCT GTACCGGTCC GATCAGCTAC ACCGGCACCG CCTTGCTCGA TACCGATCTG
GCCAACTTCG CAGCCGGCGT CAGCTCAATC TCGGCAGGAT CGGGTCAGCC GACCGAGCGG
TTCATGAATG CCGCGTCACC TGGAGTTATC GCGCTCTATC TTCCGAACCA GTTCTATGCC
AGTTTGGATG AGTACCTGTT CGCGTTGGCC GAAGGAATGA GGGCCGAGTA CGAGGCGATC
ACCGCAGCCG GGCTGGTCCT GCAGATCGAC GCCCCGGATC TGGCGATGGG TCGGCACATC
CAGTACGCGC ACCTGTCCGA GCAGGGATTC CTGGACCGGC TGCGCGTGCA CGTTGAGGCG
ATCAACCACG CGCTACGCAA TATCGACCCG GCGAGGGTGC GGGTGCACCT GTGCTGGGGC
AACTACCAGG GCCCGCACCA CAAGGACGTC GGCCTGGACG TCATCCTGGA CACGATCATT
CAGCTCAAGG CCGATGGGCT GGTATTCGAG GCCGCCAATC ACCGCCACGC ACATGAATGG
CAGGTGCTGG CCGACGCGAA GATTCCCGAG CAGAAGGTCC TCATCCCGGG TGTCATCGAC
ACCTCCAGCG TCTACGTCGA ACACCCCGAA CTCATCGCCC AGCGCATCAC CCGCTTCGCC
GACATCGTCG GCCGCGAGCG CGTCATCCCC GGAACCGACT GCGGCTTCGC GTCCTTCGCC
ACCTTCCTCG CCGTCGACGA GAGCCTGGCC TGGGCGAAAC TCGAATCCCT CACCGCCGGC
GCTCGACTGG CCAGCGATCG ACTGTGGTCA TGA
 
Protein sequence
MKLSSDRILT THTGSLPRPA GLAELIRARE QETLSVADAE YLPERIADAV GVVVGHQAQV 
GLDVISDGEM SKIGYATYVK ERLTGFDVDV AVPEGGGLSI ADLDDYPGMA ERSLAGLETA
TPTCTGPISY TGTALLDTDL ANFAAGVSSI SAGSGQPTER FMNAASPGVI ALYLPNQFYA
SLDEYLFALA EGMRAEYEAI TAAGLVLQID APDLAMGRHI QYAHLSEQGF LDRLRVHVEA
INHALRNIDP ARVRVHLCWG NYQGPHHKDV GLDVILDTII QLKADGLVFE AANHRHAHEW
QVLADAKIPE QKVLIPGVID TSSVYVEHPE LIAQRITRFA DIVGRERVIP GTDCGFASFA
TFLAVDESLA WAKLESLTAG ARLASDRLWS