Gene Franean1_3143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3143 
Symbol 
ID5671520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3698724 
End bp3699980 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content70% 
IMG OID641242038 
Producttransposase IS4 family protein 
Protein accessionYP_001507458 
Protein GI158314950 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCGCCG CGTGCACGTA CTACATTTCC AGATCTTGGA ACTTTGTCGG GTGGCTGATC 
GTCGGGCCTG GCATGGTTGG GGTCGTGGCA GCGGAGGATG TCGTCGGGTG GGAGCGGGAG
CTCGCGGCGT TGACGGACGG GCTGGGTGGG TTGTTCAACC GGCCTGAGCC CAGGCGTGTG
TTCGGTGACT TCGTGCGGGC GCTGCTGGCG GACGTACCGA AGAAGAACTC GTGGGGGCTG
GCCGAGCATG CGGGTTATGC AACGCCGCGG CCGTTCGAGC ATCTGCTCGA CGGGGCTGTG
TGGGACGCCG ATCTGCTGCG CGACGCGGTG CGGGAGTTCG TGGTCGACCG GCTCGGGTCG
CCGGTGGGTG TGCTGGTCGT CGATGACACG CAGGCGTTGA AGAAAGGTGA CAAGTCGGTG
GGGGTGGCTC CTCAGTACTA CGGGCTGACC GGGGACGTCG CGAACGTGCA GACCATGGTC
ATGTGTACCT ATGCCTCGCC GGCCGGGCAC GCGTTCGTGG ACCGGGAGTT GTACCTGCCC
GAGGTGTGGA CCAGCGACCC GGCCCGCTGC CGGGCGGCCG GCGTGCCCAC CGACCGACAG
TTCGCCACGA AACCCCAGCT CGCGGTGGCG ATGCTGACCC GGGCGGTCGA CGCCGGGGTG
CCGTTTCGCT GGGTCGTCGC CGACAGCGGC TACGGCAAGG ACGCCCGGCT GCGGGGGTTC
TGCCACGACC GGGGGCTGTC CTACGTGCTG GCCGTCCCGA AGAACCTCGC CCTCCTCGAC
GCCCGGGGCC GGCCGACCCG CCCGGACCGG TTACACGCCC GGCTGCCCGT GGGAGTGTTC
GAGCGCCGTT CGTGCGGTGC CGGGTCGAAA GGCGCCCGCT GGTATGACTG GGCCGCCCAC
GCGGTCACCG TCGCCGGAGA GGACCCGGCC AGCGGGCACG CTCACACCCT GCTGGTGCGT
AAGTCCACCA CCCCGCGTAC TCGTGACGGC AAGACCTTCT ACGACGTCGA GTACTTCCTC
GCCCACGCCC CGACCGCGAC CGGCGTCCCC GACCTGGTCG CCGCCGCCGG GACGAGGTGG
ACCATCGAGG AAAACAACGG CCAGGGCAAG GACGTCCTCG GTCTCGACCA GTACCAGGTC
CGGAAATGGA CCCCCTGGCA CCGACACGTC ACCCTCAGCA TGCTCGCCCA GGCGTTCCTC
GCCGCGACCC GCGCCAACCC GGGAAAAGAC CCCCGCATCC AGGAGGCCAC CAGCTAA
 
Protein sequence
MAAACTYYIS RSWNFVGWLI VGPGMVGVVA AEDVVGWERE LAALTDGLGG LFNRPEPRRV 
FGDFVRALLA DVPKKNSWGL AEHAGYATPR PFEHLLDGAV WDADLLRDAV REFVVDRLGS
PVGVLVVDDT QALKKGDKSV GVAPQYYGLT GDVANVQTMV MCTYASPAGH AFVDRELYLP
EVWTSDPARC RAAGVPTDRQ FATKPQLAVA MLTRAVDAGV PFRWVVADSG YGKDARLRGF
CHDRGLSYVL AVPKNLALLD ARGRPTRPDR LHARLPVGVF ERRSCGAGSK GARWYDWAAH
AVTVAGEDPA SGHAHTLLVR KSTTPRTRDG KTFYDVEYFL AHAPTATGVP DLVAAAGTRW
TIEENNGQGK DVLGLDQYQV RKWTPWHRHV TLSMLAQAFL AATRANPGKD PRIQEATS