Gene Franean1_4142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4142 
Symbol 
ID5672499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4924092 
End bp4925363 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content71% 
IMG OID641243017 
Productputative transposase 
Protein accessionYP_001508434 
Protein GI158315926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.304536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTGCACGT TGCCGCGGCT GGGCTTCGTG CCGGACGACG TGACGTCGGC GCCGCGGGCG 
GCGATGGTAC GGCTGGCTGA GCGGTTAGGC GTCGACCCGG ACGTCATCGG TTCGTACGGA
CGGCGGGCCA AGACGCGCAC GGATCATCTG CGGCTGGTGG CGCAGTATCT GGGGTGGCGG
GTGCCGACGA CGCTGGACCT CAAGGAGCTG GACGAGTTTC TGTTGGCGCG GGCGATGGAA
CACGACGCGC CGACGTTGCT GTTCCGGCTG GCGTGCGAGT ACCTGATCTC GGCGAAGGTG
ATCCGGCCGG GTCCGGTGAC GGTGGTGAAG CGGGTCGCGC ACGCCCGCGA GGTCGCGCAG
CAGGAGACGT TCGACCGGCT GGCGCACGAG TTCACCGGCG AGCGTCGCGT TGGGCTGGAC
GGCCTGCTGG TGACCGACCC TGAGATCGGG CGCGCACACC CTGGATTGTC GGTGCTGCCG
GCGGAACGGC GCAGGTTCCT GGCCACGGTG GGCCGCCGGC TGACGGCGCA GGCGTTACAG
CGGCGCGAGC CGCAGCGTCG GTACCCAATC CTGCTGACCT TGTTGGCGCA GTCGGCGATC
GACGTGCTGG ACGAGGTCGT GCAGTTGTTC GACCAGGCCG TCTCGGCGCG GGAAGCCAAG
GCCGCGCACA GGATGCGCGA CGAGTTGGCC GAGCGCGGCA AGGCTGGCGA GGAGCGCCAG
GCCCTGCTGG ACACGGTCTT GGCGATCGTC GCCGACCCGG CGATCCCGGA CGAGGACGTC
GGCGGGCTGA TCCGCGGGGA GAAGGTGGGG TGGGAGCGGC TGCGTGCCGC GCAGGCCGCC
GCGTTGCCAC CGCTGCCACG CGACCACGGG CACCTCGCGT CGCTGGACGG TTCCTACGGG
TACCTGCGGC AGTTCACCCC ACAGGTACTG GACGCGGTGA CCTTCGCCGG CGGCACGGCC
ACGGCCGACC TGCTCAAAGC GGTGGACATC CTGCGTGAGT TGAACGCCAC CGGGGCCCGC
AAGGTCCCGG ATGACGCGCC GAGCGGCTTC GTCCCGGCCC GCTGGCGCGG CTACCTCGAC
ACCGCGGCGA AGGCGGACAG TGTCACCGCC TACCGCCACT ACTGGGAGCT GTGCACCTTG
CTGGCGTTAC GCGACGGGCT ACGTACCGGC GACGTGTTCG TGTCTGGCTC GCGCCGCTAC
TCCGACCCGG CCGCCCACCT GCTCACCCCT GAGAAGTGGG CCGACCAGCG GGCCGATGGA
CGCCGACGGT GA
 
Protein sequence
MCTLPRLGFV PDDVTSAPRA AMVRLAERLG VDPDVIGSYG RRAKTRTDHL RLVAQYLGWR 
VPTTLDLKEL DEFLLARAME HDAPTLLFRL ACEYLISAKV IRPGPVTVVK RVAHAREVAQ
QETFDRLAHE FTGERRVGLD GLLVTDPEIG RAHPGLSVLP AERRRFLATV GRRLTAQALQ
RREPQRRYPI LLTLLAQSAI DVLDEVVQLF DQAVSAREAK AAHRMRDELA ERGKAGEERQ
ALLDTVLAIV ADPAIPDEDV GGLIRGEKVG WERLRAAQAA ALPPLPRDHG HLASLDGSYG
YLRQFTPQVL DAVTFAGGTA TADLLKAVDI LRELNATGAR KVPDDAPSGF VPARWRGYLD
TAAKADSVTA YRHYWELCTL LALRDGLRTG DVFVSGSRRY SDPAAHLLTP EKWADQRADG
RRR