Gene Franean1_5390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5390 
Symbol 
ID5673722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6499663 
End bp6501276 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content71% 
IMG OID641244246 
Producttransposase 
Protein accessionYP_001509652 
Protein GI158317144 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGTCG CGATGGACTA CGGGCCGCCG AGCGTCGAGA AGCCGCTCGG CGCGCTGCCC 
GTCGTGCGTG ACTACCTGGC CCGTCTCGAT GTCGCGGGCA CGATCGATCG ACTCGCCCCG
ATGCGTGACA AGGTCAACCG GGCCACCCAC GGTGACGTGA TCGCCGCGCT GGTCGCGAAC
CGGCTGACCT CGCCGACCCC GCTGCTGCAC GTCGAGCGGT GGGCACGCCA GTGGGCGGTC
GAGGAGATGT TCGGTATCGC GCCGGACGTG TTGAACGACG ACCGGGTCGG TCGGGCGCTT
GACGCGCTCG CCCCGGTCTG TGAGGCCGTG GTCGGTTCGG TCGGTGCCGC GGCGATCGCC
GCGTTCGGCC TCGACGTCTC CCGTATCCAC TGGGACATGA CGTCGATCTC GCTGCACGGC
GCCTACCCCG AGGTCGACGA GGACTACACG ACCCCGAGAT ACGGCCATCC CAAGGACCGC
CGGCCCGACC TCAAACAGGT CCAGACCGGG CTCGCGGTCA CCGGTGACGG CGGGATCCCG
CTGGTCCACC GCGCCTACGA CGGCGGCGCC GGCGAGGTCG CCCAGGTCAC CGGCGCGATG
CGCGCCCTGG CCACGATCGC CGGCGAACGG CGGTTCCTGC TGGTCGGAGA CAGCAAGCTC
GTTTCCTACG GAAACCTGAC CGCGCTGATC GACGCCGGGG TGGAGTTCCT CGCTCCGGCA
CCGAAACCCG TGGTGCCCGC TTCGACGCTC GCCGGGCTGG ACTGGGCCAG CGCTGGCATC
GTCGAGTACG TCGCCGACCG TGACCAGGCC AGGTTCGCTC ACCAGCGGGC CTCCTACCGG
GCCCGCGAAG GGGTGACAAC GCTGCGCGGG CCACGCAAGA AGGACCCGGT CCTCACGGTG
CGCACGGTCT ACATCTGGTC GTCGGCCCGC GCGCACGCCG CGCGGGCTGC CCGGTCGAGC
AAACTCGACC GGGCCCGCGA CGACCTCGAC CGGCTCTCCC ACGCCGCCGG TTCCCACCAC
CTCTACCGCG ACGTTGCCGC GGTGACCGCC CGCGTCGCGA CGATCGCGGC GAAACGCCGC
GTCGCCGGCT ACCTCCTCAC CGAGACGAGC ACCGACCCGA ACACCGGGAA ACCTGTCCTG
ACCTGGCATT TCGACCAGGC CGCGCTCGAC GCCGAGGCCG CCACCGACGG CTGGTACGCC
CTGCTGACCA ACCTGCCTGA CGACGTCGGC CCGGCCGAGG TCCTCGCCCG TTACAAGGGC
CAGGAGGTCG TCGAACGGCG CTACGGCGCG TTCAAGGGCC CGCTCGCGGT CGCCCCGATG
TTCCTGCACT CCAACCAGCG CATCCACGCG CTCATCCACG TCATCTGCCT GGCGCTGCTC
GTCTTCTCGC TCATCGAGCG CCAGGCACGG CTCGGTGCTG GCCCGGACGG GAAGATCCCC
GGCCTCTACG CCGGCCGGCC CGCCCGGCCC ACCGGCGCGC TCGTCCTCGG CGCACTCAAC
ACACTGCGCC TCATGCCCGC GCGTGACGGC CAGCCCGCTT ACATCCCCCG CCCACCGCGC
CTTCACCAAC ACCTCCTCGA CATCCTCGGC ACCGACCCGA CCCGACCACC TTGA
 
Protein sequence
MSVAMDYGPP SVEKPLGALP VVRDYLARLD VAGTIDRLAP MRDKVNRATH GDVIAALVAN 
RLTSPTPLLH VERWARQWAV EEMFGIAPDV LNDDRVGRAL DALAPVCEAV VGSVGAAAIA
AFGLDVSRIH WDMTSISLHG AYPEVDEDYT TPRYGHPKDR RPDLKQVQTG LAVTGDGGIP
LVHRAYDGGA GEVAQVTGAM RALATIAGER RFLLVGDSKL VSYGNLTALI DAGVEFLAPA
PKPVVPASTL AGLDWASAGI VEYVADRDQA RFAHQRASYR AREGVTTLRG PRKKDPVLTV
RTVYIWSSAR AHAARAARSS KLDRARDDLD RLSHAAGSHH LYRDVAAVTA RVATIAAKRR
VAGYLLTETS TDPNTGKPVL TWHFDQAALD AEAATDGWYA LLTNLPDDVG PAEVLARYKG
QEVVERRYGA FKGPLAVAPM FLHSNQRIHA LIHVICLALL VFSLIERQAR LGAGPDGKIP
GLYAGRPARP TGALVLGALN TLRLMPARDG QPAYIPRPPR LHQHLLDILG TDPTRPP