Gene Franean1_3272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3272 
Symbol 
ID5671646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3876749 
End bp3877921 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content69% 
IMG OID641242164 
Producttransposase IS4 family protein 
Protein accessionYP_001507584 
Protein GI158315076 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAACAGT CCCGATCTGT CGTGGTGGTG GAGGCGTGGG CGGCTGAGCT GGAGGGGTTG 
TGTTCGCTGG TCGGGGGCCG GTTCGGGAGG GCGGAGCCGC GGCGGCGGGT GGCCGAGTAT
GTGTCGGGCC TGGTCGCGGG TTTGGATCGC AAGAACGGCT GGACGTTGGC GGAGCGCGCC
GGGGAGGTGA GCCCGGACGG GATGCAGCGT CTGCTGCGCC GCGCTGACTG GGACGTCGAC
GGCGTCCGCG ACGACATCCG CGATCATGTG GTGGGCCGGC TCGGTGACCC GGACGCCGTG
CTGATCGTCG ATGACACCGG GTTCCTGAAG AAGGGCACCC GGTCGGCCGG GGTGCAAAGG
CAGTACTCCG GGACGGCGGG GCGTACGGAG AACTGCCAGG TCGGGACGTT TCTGGCCTAT
CGGTCCCGGT TCGGGCAGGC GCTGATCGAT CGGGAGTTGT ATCTGCCCGA GGGGTGGATC
GCTGATCGGG AGCGCTGCCG CCGGGCGGGA ATCGACGACG AGGTGGCGTT CGCAACGAAG
CCTCGCCAGG CCCTCGCCAT GATCGAGCGG ACGGTCGCGT CAGGGGTGCC GTTCGGCTGG
GTGACTGCCG ACGAGGCCTA CGGACAGGTG AAATACCTGC GAGTCTGGCT CGAACAACAC
GACGTGGCGC ACGTGCTGGC GACCCGGCGC AACGACGACC TGATCACGAC CACGATGGGC
CAGGCCAGAG CCGACGAGCT GATCGCCGGA CTCTCGCCGC GGGCCTGGTG CCGGATCTCG
GCCGGCACCG GTTCCCACGG GCTGCGGGAC TACGACTGGG CGCGGGTACC GATCCGCATC
CGGACCTGGT GGACACCAGG CCGCGGCCAC TGGCTGCTCG CCCGCCGCAG CCGGACGTCC
GGCGAACTGG CCTACTACAT CTGCTACGGC CCCCGCCGCA CCTCGCTGGC CCAGCTCGCG
ACCGTCGCCG GTGCTCGCTG GGCCATCGAA GAGGCCTTCC AACAGGCCAA GCAGACCTGC
GGGCTGGACG ACTACCAGGT CCGCGACTAT CGAGCCTGGT ACGCCCACAT CACCTTGTCG
ATGCTCGCCT ACGCAGCCCT TGCCACGGTC CGCGCCGAAC AGGTCAAAGC CAGCCAGGTA
AAAGGGGCCG AAGCCCAGCC CACCAGGGCA TGA
 
Protein sequence
MKQSRSVVVV EAWAAELEGL CSLVGGRFGR AEPRRRVAEY VSGLVAGLDR KNGWTLAERA 
GEVSPDGMQR LLRRADWDVD GVRDDIRDHV VGRLGDPDAV LIVDDTGFLK KGTRSAGVQR
QYSGTAGRTE NCQVGTFLAY RSRFGQALID RELYLPEGWI ADRERCRRAG IDDEVAFATK
PRQALAMIER TVASGVPFGW VTADEAYGQV KYLRVWLEQH DVAHVLATRR NDDLITTTMG
QARADELIAG LSPRAWCRIS AGTGSHGLRD YDWARVPIRI RTWWTPGRGH WLLARRSRTS
GELAYYICYG PRRTSLAQLA TVAGARWAIE EAFQQAKQTC GLDDYQVRDY RAWYAHITLS
MLAYAALATV RAEQVKASQV KGAEAQPTRA