Gene Franean1_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0125 
Symbol 
ID5668550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp149587 
End bp150732 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content73% 
IMG OID641239053 
Productrhomboid family protein 
Protein accessionYP_001504498 
Protein GI158311990 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACT CGCCAGCGGG TGATCCGGCC CGTGCGCCGG AAGGCTCTTC TGGTGCGCCC 
GGGGGTTCCG GTGGGGCCGG TAGTGCCGGC GAAGCCGAGA TCCCGCGTCC CGCGTCCCCC
CTGCCCGCCG CAGGCGAGCC GCCGCCCGCC GGCGGCCCTC CCAACGGCGG CCCTCCCAAC
GGCGGCCCTG CGTCGGGCGG CCCGCCGCAC GAGCAGGTCG GATGGCGGCC GGAGACCGGC
CCGCCCGCCG GCCACGCCGG GTGGACCCCA CCGCCGGCGG GAGCGCCGAG CCTGCCCCAC
TGCTATCGGC ATCCCGAGCG GGAGACGTAC GTCACCTGCC AGCGGTGCGG GCGTCCCATC
TGCCCGGACT GCATGCGCCC GGCGGCTGTG GGCTTCCACT GCCCGGAGGA GTCCGGCGCC
GGCGGTGGCG GGCGCCCCGA GCGGCGCCGA GAGCCGCGGA CGGACTTCGG TGGCCGGCCG
GGAGCCGGTC GCCGCGGGCT GGTCACCCAG GTTCTGATCA GCCTGTGCCT CGTCGCGTTC
GTCCTGCAGG GCCTGCCCGG GCTGGCGCGC GACTCCGGCT CCCTGAACCA GTTCAGCGCC
GACTTCCGCC TGTACGGCGT GTCTCTCGCG TGGGACGACC AGTACTACCG GCTCCTCACC
GCCGCCTTCC TGCACGTCAA CTACCTGCAC GTCCTGGTGA ACCTGTACGC GTTGTTCGTG
CTCGGCTACC AGCTCGAGGC GATTCTCGGG CGGCTTCGCC TGGTAGCCCT GTTCGTCGCC
TGCGCCGTCG GTGGGAACAC CCTGAGCTAC CTGGTGAACG GTGTGTCCGT GAACTCGGTC
GGGGCGTCCA CCGCGATCTT CGGTTTTTTC GGCGCGTACT ACGTGATCGC CCGGCGGCTG
CGCGCCGACA CGACGCAGAT CCTGATCCTG ATCGGGATCA ACTTCGCGCT CACGTTCACG
CTGTCCTTCA TCGACCGCTG GGGCCACGTC GGGGGGCTGG TGGCCGGGGT GCTCGTCGGC
CTGCTCTACG CCTACGTCCC GCCGCGCCGA ACGGTCGTGC AGGCGGCCGG GGTGCTGGCG
CTTGTCGGCC TGCTCTTCGC GGCGGCCGTC ATCAAGAGCG CGGACCTGAC CACCGCCTTC
GCCTAG
 
Protein sequence
MTDSPAGDPA RAPEGSSGAP GGSGGAGSAG EAEIPRPASP LPAAGEPPPA GGPPNGGPPN 
GGPASGGPPH EQVGWRPETG PPAGHAGWTP PPAGAPSLPH CYRHPERETY VTCQRCGRPI
CPDCMRPAAV GFHCPEESGA GGGGRPERRR EPRTDFGGRP GAGRRGLVTQ VLISLCLVAF
VLQGLPGLAR DSGSLNQFSA DFRLYGVSLA WDDQYYRLLT AAFLHVNYLH VLVNLYALFV
LGYQLEAILG RLRLVALFVA CAVGGNTLSY LVNGVSVNSV GASTAIFGFF GAYYVIARRL
RADTTQILIL IGINFALTFT LSFIDRWGHV GGLVAGVLVG LLYAYVPPRR TVVQAAGVLA
LVGLLFAAAV IKSADLTTAF A