Gene Franean1_3748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3748 
Symbol 
ID5672113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4439007 
End bp4440284 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content72% 
IMG OID641242629 
Producttransposase IS4 family protein 
Protein accessionYP_001508049 
Protein GI158315541 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.128995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGCGT GGCCTCGGTG CCGCTGGGTG GTTTGGTCAC GGTTCGTGGC TTCGATGGTG 
ACCCCGGACC AGGTGTCGGT CGGGGTGCTG GTGACGGCGG TGCCGCGTGA CGCGGTCGAC
GAGGCCGTCG CGGCCTGTGG GGTGGGTGCG CGGCGGGCGG GCGGGAAGCT CCCACCGCAT
GTGACGGCGT ACCTGACGTT GGCGATGTCC CTGTTTCCGG ACGACGACTA CGCCGAGGTC
GCCCAGAAGG TGACCGGGTC GCTGGACCGG TTCGGCTGCT GGGACGCGGC GTGGGCGCCG
CCGAGCGCGA GCGGGATCAC CCAGGCGCGT AAGCGGCTGG GCCGGATGGT GATGGCCGAG
GTGTTCGAGC GGGTCGCGGG CCAGGTCGCG ACACTGTCGA CGCGTGGCGC GTGGCTGCGG
GGCCGGTTGT TGCTCGCGAT CGACGGGTTT GACGTCGACG TGCCCGACAC CGAGGAGAAC
GCGGCCGAGT TCGGCTACGC CGGCACCGGG GAGAAGCGGT CGGCGTTCCC GAAGATCCGG
GTCGTCGCGT TGGCGGAGTG CGGGACGCAC GCGTTCCGGG CCGCCGAGGT CGGTGGCTGG
GCGGCTGGGG AGAGGACGCT GGCCCGCGGG CTGCTGATGC GGCTGAACCG CGACGAGGTG
CTGACCGCCG ACCGTGGGTT CTACTCGTTC GACAACTGGG CGCTGGCCGC GGGCACCGGC
GCCGACCTGA TCTGGCGGGC CCCGACCGGG CTGAACCTGC CGGTCGTGCG GGTCCTGTCC
GATGGCACGT TCCTCACCGT CCTGATCAAC CCGGAGATCA CGGGAGGTCG GCGCCGCGAG
CGGCTGCTCG CCGCCGCGAA GGCCGGCGAC GAGCTTGATC CGGACGAGGC GCACCTGGCC
CGGGTCGTCG AGTACGACAT CCCCGACCGG GCCGGTAACG GTACCGGCGA ACTGGTCGTC
GTGCTGACCA CGATCCTCGA CCCGCGTCAG GCCCGTGCCG ACGAGGTCGC CGCCGGATAC
AACGAGCGCT GGGAGGAGGA AACCGCGAAC GACCAGCTCA AGACCCATCT ACGCGGCCCC
GGGAGAGTCC TGCGCTCCCG GCTGCCGGAC CTGGCGGTCC AGGAGATGTG GGCCTGGCTG
ATCGTCCAGT ACGCGCTCAC CGCCCTGATC GCCGGCGCCG CGGAGGCCGC CGAGATCGAC
CCCGACCGGG TCGGTTTCGC CCGGACACTG CGCCTGGTCC GCCGTTCCGC CACCGGAACG
GCGGACATTT CCCCCTGA
 
Protein sequence
MAAWPRCRWV VWSRFVASMV TPDQVSVGVL VTAVPRDAVD EAVAACGVGA RRAGGKLPPH 
VTAYLTLAMS LFPDDDYAEV AQKVTGSLDR FGCWDAAWAP PSASGITQAR KRLGRMVMAE
VFERVAGQVA TLSTRGAWLR GRLLLAIDGF DVDVPDTEEN AAEFGYAGTG EKRSAFPKIR
VVALAECGTH AFRAAEVGGW AAGERTLARG LLMRLNRDEV LTADRGFYSF DNWALAAGTG
ADLIWRAPTG LNLPVVRVLS DGTFLTVLIN PEITGGRRRE RLLAAAKAGD ELDPDEAHLA
RVVEYDIPDR AGNGTGELVV VLTTILDPRQ ARADEVAAGY NERWEEETAN DQLKTHLRGP
GRVLRSRLPD LAVQEMWAWL IVQYALTALI AGAAEAAEID PDRVGFARTL RLVRRSATGT
ADISP