Gene Franean1_1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1855 
Symbol 
ID5670257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2225925 
End bp2228408 
Gene Length2484 bp 
Protein Length827 aa 
Translation table11 
GC content75% 
IMG OID641240776 
Producthypothetical protein 
Protein accessionYP_001506199 
Protein GI158313691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.152269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00336297 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGATCCACA TACACAATCG GGGGCCCAAA GGGGACCATT TGGACAATTC CGCGGAACGA 
TCGTGGACCA TGGGTGCCGT GACGGCACCT GGACAGCTCC ACGCCTCCGG ACCCCGCGGC
GCACTGGAGG CCCCTGGCCG ATCCCGACCC GGCGACGGCG ACGCCTGGAC CACGCCCGCC
GTGAGCCCGG CCACCGGCGA CGACGAGAGC GGCTCCGGCC ACCGCGCCGC CCTGGGGCCG
GCCGCGGGCC CGTCCGAGCT CGAGCCCGGC TTCGAGGCCG AGCCCCGCCC CGGGCTGAAC
CGGCGCGGGC GCTTCCGCCG GCGGTCAGTT ACCGGACGGC GCTCCCCGCT CACCCGGCAC
TCCGTGTTCG GAGTTTTCCT GCTCGCCGGC GTCGTGCTGC GGCTCGTCAC CACCTACGCC
TACCGGCCCG TCTTCGAGTT CAACGGCGAC TCGTACGCGT ACATCCGGCT TACCCGGCTG
TCAGAACCGG ACCCGATGCG CCCGGCCGGC TACCCGGCGT TCCTGCGCGT ACTGACCGAG
ACCGGAGCCG ACCTGTGGGT CGTCCCCCTG GTACAGCACG TGCTCGGGAT CCTGCTCGCG
ACGGCCCTCT ACGTGCTGCT CCTCCACCGC AGGGTGGCCC CGCCGATCGC CGCGCTCGCC
ACCGCACCGA TCCTGCTCGA CGCCTACCAG ATCGTGATCG AGCACTTCGT GATGGCCGAG
ACGCTCTTCG CCGTCCTCCT CGTCGCCGCG GTCGTCGCCC TGATGTGGTC ACGCCGGCCG
TCTGTATGGG CGTTCGCCCT GGCCGGCCTG CTCCTCGGCG CGTCGGGCCT GGTGCGCACG
ATCGGCGTGG CTATCGGCCT GCTGGCCTTC GGCTACGTGG TGCTGCGCCG GGTCGGCTGG
CTGCGGGTCG GGGTCTTCGC CGTCTTCCTG GCCGCGCCGC TGATCGCGTA CGCGTCCTGG
TTCCAGTCCG CGCACGGGAA GATCGGGCTC ACCGGGGGCG ACGCCGCCTG GCTGTACGGC
CGGGTCGCCC CGATCGCCGA CTGCGGCCAG CTCGACCTCG AGCCCAGCCA GCTCTCGCTG
TGCTCCCCGC ACCCGGTCGG CGAGCGGCCC GACCCGAGCT ACTACGTGTG GGACCGCAAC
AGCCCGAGCA ATCAGCTCGA CGTCCCCGTC GACGAACGCG ACCGGCTGCT GAACGACTTC
TCGCGGCAGG TCATCACCCG GCAGCCCGTT GACTACCTGC GCATGGTCGG CTCCGACATC
GCGCACTACT TCGAGCCGGG ACGCCGCGTC GGGCCCCGGG ACTGGCCGGA CGCCACCTGG
CGCTTCCCGA CCGCGGACGA GCCCCGCTAC CTGCACAACG ACGAGCCGCT GCTGGGCCTG
CACGGGGAGG CCGACCGGCC GGATCGGACC GTGATCGAGC CGTGGGCCGA CTATCTGCGG
GCCTACCAGA GCCGGGGTTT CACCCCCGGC CCGGCGCTCG CCGTGGCCGG TGTCCTCGGC
CTGCTCTCCT GCCTGGCCGC GCTGCCGCGG GTCGTCCCGG CGGGCCTGCG CGGGGGTGAC
GGGCGTGCCC GCTGGCACGA CCTGACCGCG GAACGCCGGC GCACCGGCGC CGACTGCCTG
TTCCTGGTCG CGACCGGGGC AACGATGATC ATCGTGCCGG CGGCCACCGT CTGCTTCGAC
TACCGGTATC TGCTGCCAGC GCTGTTCCTG CTGCCGCCGG CAGCCGCGCT CGCCGTCCAC
CAGGGCCACC TGCTGGTCGT CGCGTGGCGT GAGCGGCGCG AGGCCGCCGA GACTCCCTGG
CGTACGCCTC CCGGCCCGAC CGGCCTCGGC GCGGCCGACC CTGACACCGA CGGTGACCCG
ACCGGTGCTG ACACGCTCAG CGCCACCGAG ACCGGCCCTG ACGGCGGCGG CGCGACCGGT
TTCGGCGCGC CAGGCGATCG CGCCGATCCG GACGCCCCGT TCAGCCTCGG CGGTCTCGGC
GCGCCGCCGG GCCGCGTGGC CCCGGCAGGC CACGGCGATC ATCGCGAAGG CAGGCCCACG
AGGCCGGCCA CGCCGGCCAG ACCCATCCCG CCGACGACAC CGAACCCGCC GGCGCCGATT
CCGCCGACGG ATCCGATTCC GTCAGCGGCG TCGGCCAGGC CGGCCCCTCT CCGGAGCGGA
CCGAACCCGG GCGCGGCGGC GGCACGGCGG CAGAACCCGT CCGGCACACC GCCGCTGCCG
AAACGTGCGC CCGGGGTCAC GCTCGAGGCG AGGAACCGCC GATCGCGTCC CGCCGCAGCC
GGCGGGGCCG CCCCTGAGAC GCCGTCCGGA CGAAGCACGC CGCAACGCCG GCCCGTGAGC
CTCGGCCCGG ACGCCCGGAT GCCCACCCCG GGCAGCCGAC CCGCGGCGCG CGGCCCGGCC
GCGGCGAACG CCGGGGACGA CCCGACGGCG CCGCCCACGA AGGTCGAGCC CGACGCCGGT
GACGATCCGA CGTTCCCCGG TTGA
 
Protein sequence
MIHIHNRGPK GDHLDNSAER SWTMGAVTAP GQLHASGPRG ALEAPGRSRP GDGDAWTTPA 
VSPATGDDES GSGHRAALGP AAGPSELEPG FEAEPRPGLN RRGRFRRRSV TGRRSPLTRH
SVFGVFLLAG VVLRLVTTYA YRPVFEFNGD SYAYIRLTRL SEPDPMRPAG YPAFLRVLTE
TGADLWVVPL VQHVLGILLA TALYVLLLHR RVAPPIAALA TAPILLDAYQ IVIEHFVMAE
TLFAVLLVAA VVALMWSRRP SVWAFALAGL LLGASGLVRT IGVAIGLLAF GYVVLRRVGW
LRVGVFAVFL AAPLIAYASW FQSAHGKIGL TGGDAAWLYG RVAPIADCGQ LDLEPSQLSL
CSPHPVGERP DPSYYVWDRN SPSNQLDVPV DERDRLLNDF SRQVITRQPV DYLRMVGSDI
AHYFEPGRRV GPRDWPDATW RFPTADEPRY LHNDEPLLGL HGEADRPDRT VIEPWADYLR
AYQSRGFTPG PALAVAGVLG LLSCLAALPR VVPAGLRGGD GRARWHDLTA ERRRTGADCL
FLVATGATMI IVPAATVCFD YRYLLPALFL LPPAAALAVH QGHLLVVAWR ERREAAETPW
RTPPGPTGLG AADPDTDGDP TGADTLSATE TGPDGGGATG FGAPGDRADP DAPFSLGGLG
APPGRVAPAG HGDHREGRPT RPATPARPIP PTTPNPPAPI PPTDPIPSAA SARPAPLRSG
PNPGAAAARR QNPSGTPPLP KRAPGVTLEA RNRRSRPAAA GGAAPETPSG RSTPQRRPVS
LGPDARMPTP GSRPAARGPA AANAGDDPTA PPTKVEPDAG DDPTFPG