Gene Franean1_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3039 
Symbol 
ID5671418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3572412 
End bp3573518 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content69% 
IMG OID641241937 
Productintegrase catalytic region 
Protein accessionYP_001507357 
Protein GI158314849 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTGCCG GACAGGGCCC GCGTCGGGCA GGCTGGATGG TCATGGTGTG GTCCCTGTTC 
TACGCCCTGA CACGCAACGC TCTCGGAGTG ATGCTGCTCC GAGTCCGCGG GGACACCGCG
AAGGACGTGG AGCTCCTCGT CCTGCGACAT CAGGTGGCGG TGTTACGACG GCAGGTGAAC
CGCCCGGCGC TGGAACCGGC GGATCGGGTG ATCCTCGCAG CCCTGTCCCG GCTGTTGCCC
CGGGCCCGCT GGGGTTCGTT CGTCGTCACC CCGGCCACCG TATTGCGCTG GCACCGTGAC
CTCCTCGCAC GACAATGGAC CTACCCTCGG ACGTCGCCCG GACGGCCATC GGTCCGCCGG
GAGATCCGCG AGCTGGTCCT GCGCCTCGCA CGGGAGAACC CGACCTGGGG CCACCGCCGG
ATCCAAGGAG AACTCGTCGG GTTGGGCTAC CCGGTCGGGG TCGCCACCGT CTGGCGGATC
CTGCACCGCG CCGGTGTCGA TCCAGCGCCC CGTCGGGCCG ACGCCTCCTG GCGCACGTTC
CTGCGCGCCC AGGCCTCCGG CATCCTCGCC TGCGATTTCT TCACCGTGGA CACCGTATTC
CTACAACGGA TCTACGTGTT CTTCGTCGTC GAGCACGCCA CCCGCCGTGT CCACGTCCTC
GGGGTCACGA AGCATCCAAC CGCGGCCTGG GTCACCCAGC AGGCACGGAA CCTGCTGATG
GACCTCGAGG AACGTGGCCA CCGGTTCCGG TTCCTCCTCC GTGACCGCGA CACGAAATTT
ACGGTTTCCT TCGACGCTGT CTTTGCCGGA GCCGGTATCG ACGTGGTGCG CACACCGCCA
CAGTCGCCGC AGGCGAACGC GACCGCGGAA CGCTGGGTCG GCACCGCCCG CCGGGAATGC
ACCGACAGGC TGTTGATCGT CTCCGAACGG CACCTGACCA CCGCCCTCAC CACATACGCC
GAGCATTTCA ACACCCACCG GCCTCACCGC TCCCTCGGCC AGCACCCGCC CGACCCGCCA
CCCGTGGTCA CCCCGACCCC GGGTTCCACC GTCCGTCGCA CACGCATCCT CGGCGGGCTG
ATCAACGAGT ACCGCAACGC CGCCTGA
 
Protein sequence
MPAGQGPRRA GWMVMVWSLF YALTRNALGV MLLRVRGDTA KDVELLVLRH QVAVLRRQVN 
RPALEPADRV ILAALSRLLP RARWGSFVVT PATVLRWHRD LLARQWTYPR TSPGRPSVRR
EIRELVLRLA RENPTWGHRR IQGELVGLGY PVGVATVWRI LHRAGVDPAP RRADASWRTF
LRAQASGILA CDFFTVDTVF LQRIYVFFVV EHATRRVHVL GVTKHPTAAW VTQQARNLLM
DLEERGHRFR FLLRDRDTKF TVSFDAVFAG AGIDVVRTPP QSPQANATAE RWVGTARREC
TDRLLIVSER HLTTALTTYA EHFNTHRPHR SLGQHPPDPP PVVTPTPGST VRRTRILGGL
INEYRNAA