Gene Franean1_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1889 
Symbol 
ID5670291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2267030 
End bp2268307 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content72% 
IMG OID641240811 
Producttransposase IS4 family protein 
Protein accessionYP_001506233 
Protein GI158313725 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.847623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGCGT GGCCTCGGTG CCGCTGGGTG GTTTGGTCAC GGTTCGTGGC TTCGATGGTG 
ACCCCGGACC AGGTGTCGGT CGGGGTGCTG GTGACGGCGG TGCCGCGTGA CGCGGTCGAC
GAGGCCGTCG CGGCCTGTGG GGTGGGTGCG CGGCGGGCGG GCGGGAAGCT CCCACCGCAT
GTGACGGCGT ACCTGACGTT GGCGATGTCC CTGTTTCCGG ACGACGACTA CGCCGAGGTC
GCCCAGAAGG TGACCGGGTC GCTGGACCGG TTCGGCTGCT GGGACGCGGC GTGGGCGCCG
CCGAGCGCGA GCGGGATCAC CCAGGCGCGT AAGCGGCTGG GCCGGATGGT GATGGCCGAG
GTGTTCGAGC GGGTCGCGGG CCAGGTCGCG ACACTGTCGA CGCGTGGCGC GTGGCTGCGG
GGCCGGTTGT TGCTCGCGAT CGACGGGTTT GACGTCGACG TGCCCGACAC CGAGGAGAAC
GCGGCCGAGT TCGGCTACGC CGGCACCGGG GAGAAGCGGT CGGCGTTCCC GAAGATCCGG
GTCGTCGCGT TGGCGGAGTG CGGGACGCAC GCGTTCCGGG CCGCCGAGGT CGGTGGCTGG
GCGGCTGGGG AGAGGACGCT GGCCCGCGGG CTGCTGATGC GGCTGAACCG CGACGAGGTG
CTGACCGCCG ACCGTGGGTT CTACTCGTTC GACAACTGGG CGCTGGCCGC GGGCACCGGC
GCCGACCTGA TCTGGCGGGC CCCGACCGGG CTGAACCTGC CGGTCGTGCG GGTCCTGTCC
GATGGCACGT TCCTCACCGT CCTGATCAAC CCGGAGATCA CGGGAGGTCG GCGCCGCGAG
CGGCTGCTCG CCGCCGCGAA GGCCGGCGAC GAGCTTGATC CGGACGAGGC GCACCTGGCC
CGGGTCGTCG AGTACGACAT CCCCGACCGG GCCGGTAACG GTACCGGCGA ACTGGTCGTC
GTGCTGACCA CGATCCTCGA CCCGCGTCAG GCCCGTGCCG ACGAGGTCGC CGCCGGATAC
AACGAGCGCT GGGAGGAGGA AACCGCGAAC GACCAGCTCA AGACCCATCT ACGCGGCCCC
GGGAGAGTCC TGCGCTCCCG GCTGCCGGAC CTGGCGGTCC AGGAGATGTG GGCCTGGCTG
ATCGTCCAGT ACGCGCTCAC CGCCCTGATC GCCGGCGCCG CGGAGGCCGC CGAGATCGAC
CCCGACCGGG TCGGTTTCGC CCGGACACTG CGCCTGGTCC GCCGTTCCGC CACCGGAACG
GCGGACATTT CCCCCTGA
 
Protein sequence
MAAWPRCRWV VWSRFVASMV TPDQVSVGVL VTAVPRDAVD EAVAACGVGA RRAGGKLPPH 
VTAYLTLAMS LFPDDDYAEV AQKVTGSLDR FGCWDAAWAP PSASGITQAR KRLGRMVMAE
VFERVAGQVA TLSTRGAWLR GRLLLAIDGF DVDVPDTEEN AAEFGYAGTG EKRSAFPKIR
VVALAECGTH AFRAAEVGGW AAGERTLARG LLMRLNRDEV LTADRGFYSF DNWALAAGTG
ADLIWRAPTG LNLPVVRVLS DGTFLTVLIN PEITGGRRRE RLLAAAKAGD ELDPDEAHLA
RVVEYDIPDR AGNGTGELVV VLTTILDPRQ ARADEVAAGY NERWEEETAN DQLKTHLRGP
GRVLRSRLPD LAVQEMWAWL IVQYALTALI AGAAEAAEID PDRVGFARTL RLVRRSATGT
ADISP