Gene Franean1_5873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5873 
Symbol 
ID5674196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7128302 
End bp7129561 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content71% 
IMG OID641244723 
Producthypothetical protein 
Protein accessionYP_001510125 
Protein GI158317617 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0436897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.694772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGGAA ACAGCGCACC CACGCCGAAC GGTGCCGAGC CCGACCCGTT CGCCGACCCG 
TTCGCCGATC TGGTCCTTGA CGAGGCCTTC ATCGCCGGAG CCACCCGATA CGAGGCCCCG
GCCCGGACGA GAGCCGCGGT CGCCCGCTTC GGGCCGCTCG AAGACGGTTC AACGCCCTGG
CGCTCGTACG GCGGCGGCGG CCGGCCACGG CGGGCGGGAG GTGCCTCCGG ACCACGGCTG
GGCCGCGGAC GGGCGTCCCA GTCGATTCCG GAGATCTCGC CGCGGCCGCG GTCCCGGGGG
CGGATCGTGC TGGCACTGAT CAGCGTCGTG CTGCTCTCGA CCACCGTCTA CGGGTTCGTG
GCCTCGACCG TCGACAGTCC GCCGTCCACC ACGCTCTCGG CGGACCCGCC GACCCCGGCA
GGGAGTGCCT CCGGTCGCGA CGCCATCGCC GAGGAATCGC TACCGGATCT CTACCGGCGG
GACTGGACGG CCGGCCACTG CTACACCTGG CTCCAGCGGG ACGCCTCGAC CGCGGTCAAC
GACGTCCCCT GCGCCGGCCC GCACCTGTTC GAAGCGGTCG GGCCGCTCGA CATCGGCTCC
GCCTATCCTG AGGGGACGAC ATACCCGTCG CCCGCCGAGT GGGTGGCGCT CGGCAGGCGG
CAGTGCGAGC CGCTCATCAC CGCGTATCTC GGCTACGGGC TCGACCCGTT CGGGCGCTTC
GGCACGAGCG TCCTCCACTC CAAGGAAGCG GAGTGGAACA CCGGCGAGCG GGACATCGTC
TGCGGCCTGT CACTCCATCC CTCGCCCACG GCACCGTACG AGGTGCCCGA CCTCGAAGGA
CAGGTCCGCG GAGCCGACCA GGCGCTGACC TATCCCACCG GGACCTGCTT CCGTACGAAC
ACCGAGGGCC GCAACGAGGT GGTCCCCTGC GAGCAGAGCC ACCATTCGCA GAGCGTGGGC
ACGGCGACCC TGGCGGACAC CGCCGAGGGC GCACCGAGCA GTGCGCCCCT CTCGGCGGAG
CGGCTCGGCG AACTCATCGA CGCGGCCTGC GCACCCCGCA TCGCGCCCTA TCTCGACCGA
GGATTCGGGG GCGCATCCGT ACAGGGCGGA TCGCATTCCC TCCCCCCGGA AAGCTGGTGG
GCGGGAACCC GATCCACCAC CTGCACCGTG AGCCTCGTCG ACCGGAACGG CAGAAAACTG
GAGACGACCG GCCTACTCAG CCCGGACGAT GAATCGGGGT CGTTCAGCGC GAACGTGTGA
 
Protein sequence
MSGNSAPTPN GAEPDPFADP FADLVLDEAF IAGATRYEAP ARTRAAVARF GPLEDGSTPW 
RSYGGGGRPR RAGGASGPRL GRGRASQSIP EISPRPRSRG RIVLALISVV LLSTTVYGFV
ASTVDSPPST TLSADPPTPA GSASGRDAIA EESLPDLYRR DWTAGHCYTW LQRDASTAVN
DVPCAGPHLF EAVGPLDIGS AYPEGTTYPS PAEWVALGRR QCEPLITAYL GYGLDPFGRF
GTSVLHSKEA EWNTGERDIV CGLSLHPSPT APYEVPDLEG QVRGADQALT YPTGTCFRTN
TEGRNEVVPC EQSHHSQSVG TATLADTAEG APSSAPLSAE RLGELIDAAC APRIAPYLDR
GFGGASVQGG SHSLPPESWW AGTRSTTCTV SLVDRNGRKL ETTGLLSPDD ESGSFSANV