Gene Franean1_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0441 
Symbol 
ID5675655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp522170 
End bp523276 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content67% 
IMG OID641239374 
Productintegrase catalytic region 
Protein accessionYP_001504812 
Protein GI158312304 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTGCCG GACAGGGCCC ACATCGGGCA GGCTGGATGG TCATGGTGTG GTCGTTGCTC 
TACGCCCTGA CACGCAACGC TCTCGGGCTG ATGCTGCTCC GGGTCCGTGG GGACACCGCG
AAGGACGTCG AGCTCCTCGT CCTGCGGCAT CAGGTGGCGG TGTTGCGACG GCAGGTGAAC
CGCCCGGCCC TGGAACCGGC AGATCGGGTG ATCCTCGCAG CCCTGTCCCG GCTGCTACCC
CGGGCCGGCT GGGGTTCGTT CTTCGTCACC CCGGCCACCG TGCTGCGCTG GCACCGTGAG
CTCCTCGCGC GAAAATGGAC CTATCCGCGC AAGACCCCTG GGCGGCCGCC GGTCCGCCGG
GAGATCCGTG AGCTGGTTCT GCGTCTCGCG CGGGAGAATC CGACCTGGGG CCACCGCAGG
ATCCAGGGAG AACTGATCGG GCTGGGCTAC CCGGTCGGGG TCGCCACCGT CTGGCGGATC
CTGCACCGCG CTGGTGTCGA CCCCGCGCCG CGGCGGGCTG ACGCCTCTTG GCGTACGTTC
CTGTCCGCGC AGGCCTCCGG CCTGCTGGCC TGCGATTTCT TCATGGTGGA CACTGTGTTC
CTGCAGCGGA TCTACGTGTT CTTCGTCGTC GAACACGCCA CGCGCCGTGT TCATGTTCTC
GGGGTCACGA AGCATCCGAC CTCGGCGTGG GTCACCCAGC GTGCGCGGAA CCTGCTGATG
GATCTCGACG AGCGTTGCCA CCGGTTCCGG TTCCTGATCC GTGACCGCGA CATGAAGTTC
ACGGCTTCCT TCGACGCTGT CTTCATCGGG GCCGGTATCG ACGTGGTACG CACACCCCCG
CAAGCTCCGA AGGCGAACGC GATCGCGGAA CGCTGGGTCG GCACCGCCCG CCGCGAATGC
ACCGACAGAC TGCTGATCGT CTCCGAACGA CACCTGACGT CAGTCCTCAC CACCTACGCC
GAGCACTTCA ACACCCACCG GCCTCACCGC TCCCTCGGCC AGCACCCACC CGACTCGCCA
CCCGTGGTCG CCCCGACGTT GGAGTCCACC GTCCGTCGCA CACGCATCCT CGGCGGCATG
ATCAACGAAT ATCGCAACGC CGCCTGA
 
Protein sequence
MPAGQGPHRA GWMVMVWSLL YALTRNALGL MLLRVRGDTA KDVELLVLRH QVAVLRRQVN 
RPALEPADRV ILAALSRLLP RAGWGSFFVT PATVLRWHRE LLARKWTYPR KTPGRPPVRR
EIRELVLRLA RENPTWGHRR IQGELIGLGY PVGVATVWRI LHRAGVDPAP RRADASWRTF
LSAQASGLLA CDFFMVDTVF LQRIYVFFVV EHATRRVHVL GVTKHPTSAW VTQRARNLLM
DLDERCHRFR FLIRDRDMKF TASFDAVFIG AGIDVVRTPP QAPKANAIAE RWVGTARREC
TDRLLIVSER HLTSVLTTYA EHFNTHRPHR SLGQHPPDSP PVVAPTLEST VRRTRILGGM
INEYRNAA