Gene Franean1_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3535 
Symbol 
ID5671905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4194350 
End bp4196026 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content65% 
IMG OID641242422 
Producttransposase IS4 family protein 
Protein accessionYP_001507842 
Protein GI158315334 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGC AGCCACGTCC GTGGCCGCAG GTTCCTGAAC AGACCGCGGC GGTGGCCTGT 
GCGGCGTTCC CGAAAGGCAC ACTGGCAATC CGTGTTCGCG ATGAGCTGCC CGAGTTGTTT
GCTGATGAGC AGTTCCTCGC AGCGTTCGGC GTGCGCGGTA GACCAGGCAT CTCACCGGGG
CAGTTGGCGC TGGTCACGGT GTTGCAGTTC GCGGAGAACC TCACCGACCG GCAGGCGGCC
GACGCGGTAC GGGCCCGGAT CGACTGGAAA TACGCCCTCG GTCTGGAGCT GACCGACGCA
GGGTTTGATC ACACTGTGTT GACCGGGTTC CGGCAGCGGC TTATCGACCA TGGTCTGGAG
GAGAAGGTAC TGGACCTGCT GCTGGCCCGG CTGTCCGAGT TAGGACTCGT CAAGGCTGGC
GGCCGGCAAC GCACCGACTC TACGCATGTG CTGGCGGCGG TACGCTCGGC CAATCGGCTG
GAGTTCCTCG CCGAGACACT GCGAGCGGCC TTGGAGGCGT TGGCCGTGGC CGCACCGGAC
TGGCTGAGGG CCCAGATCAA CACCGAATGG GTGACACGGT ACGGCGCCCG TATGGATTCC
TACCGGATTC CGAAGGGCGA CGACAAGCGT AAAGCGATGG CCATTCAGGT TGGAGTCGAC
GGGTTCGGTC TTCTGGAAGC CGTACACACC GTCGGCGCAC CGATCTGGCT ACGTGAGATC
CCCGCCGTGG TCACCCTGCG TGCGGTGTGG CTCAGGCAGT ACCACCGCAC GATCACCCAT
GACGGGCAGG AGGTGGCGTG GCGGGAGGAA AAAGACCTCC CGCCCAGCAG AGACCGGATC
TGCTCGCCGT ACGACACCGA CGCCCGGTAC GCGACCAAAC GCGGTTCCGG CTGGGAGGGC
TACAAAGTCC ATCTCACGGA GACCTGCGAC GACGTGAGCA CGACCGGCGC GCCACACCTG
GTCACGAATG TGACCACCAC CGACGCGACC GTCACCGACG TGGAGATGCT CGAACGGATC
CACAAGGATC TCGACCGCAG ATCGTTACTT CCGGCGGAGC ACCTGGTCGA CGCCGGCTAC
ACCAGCGCCG AGCTCCTCAT CGACTCCCAG CGCGATTTCA GTATCACGTT GCTCGGTCCG
CTGCCGGCCG ACAACTCCCA CCAGGTTCAG GCCCGTGGTG GCTTCGAACG CGCCGCGTTC
GCCATCGACT GGGACAACCA GCGGGTCACC TGCCCGCAAG GCGTGACCAG CACGATCTGG
TCGTCCTGCA ACGAACGCGG CCGGGAATCG ATCGTGGTTC GTTTCCCCGT CACAGCCTGC
CAGCCATGTC CCGTCCGTTC ACAATGCACA CGAGCCACCC GGAACGGCCG CCAGTTGATG
CTACGTCCCC GCGACATCCA CGAAGCGGTC GAGCAGGCCC GCGCCAAACA GAACACCGAC
GAGTGGAAAC AGCGCTACGC AACCCGCGCC GGCGTCGAGA GCACCATCCA TCAATCAGTT
GCCGTCACCG GGATCCGCCG CTGCCGCTAC ACCGGACTAC CCAAGACCCG ACTTGCCCAC
GTCCTCGCCG CCACCGCCCT CAACCTGATC CGGTTGGACG CGTGGTGGAC CGGCACGCCA
CTCGACCGGC CTCGGGCCAG CCACCTCGCA AGACTCGACT TCAGCCTCGC CGCATAG
 
Protein sequence
MSMQPRPWPQ VPEQTAAVAC AAFPKGTLAI RVRDELPELF ADEQFLAAFG VRGRPGISPG 
QLALVTVLQF AENLTDRQAA DAVRARIDWK YALGLELTDA GFDHTVLTGF RQRLIDHGLE
EKVLDLLLAR LSELGLVKAG GRQRTDSTHV LAAVRSANRL EFLAETLRAA LEALAVAAPD
WLRAQINTEW VTRYGARMDS YRIPKGDDKR KAMAIQVGVD GFGLLEAVHT VGAPIWLREI
PAVVTLRAVW LRQYHRTITH DGQEVAWREE KDLPPSRDRI CSPYDTDARY ATKRGSGWEG
YKVHLTETCD DVSTTGAPHL VTNVTTTDAT VTDVEMLERI HKDLDRRSLL PAEHLVDAGY
TSAELLIDSQ RDFSITLLGP LPADNSHQVQ ARGGFERAAF AIDWDNQRVT CPQGVTSTIW
SSCNERGRES IVVRFPVTAC QPCPVRSQCT RATRNGRQLM LRPRDIHEAV EQARAKQNTD
EWKQRYATRA GVESTIHQSV AVTGIRRCRY TGLPKTRLAH VLAATALNLI RLDAWWTGTP
LDRPRASHLA RLDFSLAA