Gene Franean1_4655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4655 
Symbol 
ID5672998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5554002 
End bp5555609 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content72% 
IMG OID641243513 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001508929 
Protein GI158316421 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCTG CTTCGAGGGG GGCCACGCGG CGCCAGGGTC CGTGCCGGAC CGGGGCGGGC 
CACGGTGGGG ATACGGCCGG CTGTTGCGAT GCCGGGGATC TGCTGGACTC GTTGGGCGAG
GTGGTGTTCC GGACCGACGC GGAGGGTTGT TGGACGTATC TGAACCGGGC GTGGACGACC
CTGACCGGCT TCGATGTCGT GGCCAGCCTG GGTGCCCAGT TCCTGGACTA TGTTCACCCG
GACGAGGTGG AGGCCACGGT CGCGTTGTTC ATGGGTGTCG TCGCGGGGGG TGCCGATCAC
TGTCACCACG AGACCCGCTA CCGCACCCGG GACGGCAGCT ACCGGCGGGT CCAGATCAGG
GCGACCGTCC TGCGGGACGC GGCGGGCGTG GTGGTCGGCA ACACCGGGAC GATCCTGGAC
GTCACCGCCG CCCGCCGTGA CGCGCAGACC GTCGGGGAAC ACGCCGCGCT GCTGGAGTTG
GTGTCCGCAG GGGTACCCGC CGGCGAGCTG CCGGTTGGGG TGGCGGTGTA TGACGCCGAC
CTGCGGCTGT GGCGTGGTTC ACCGGTCCTG GACCAGATCG CTCGTGTGCC CCAGCGTGTG
GGGGATCGTC TTGCCCAGCT GGCCGAACGG CTCTCGCCGG CGGCCGGGCC GCGGCAGCGC
TCGCTGGGTG GCGAGTGGGG CATGATCGCG GCTGCGCTGC GTACCCACCA CGCCCAGGTC
GGTGATCTCG ACCTCGTGGG GGAGCGCGGT ACCGGCCGGT CGATGCGGGT CACAGTGATC
CCATACGTAC GGGGCGGTGA GGAGATGCTC GCGCTCGTCT TCACCGACAT CACCGACCTG
CGCCGCGCCG AGCGCGAGCT GCGCCGGCTC TACGAGCAGG CGGAACGGAG CCGGGCCTGG
CTGGCGGCGA GCACCGAGAT CACGACCAGC GTGCTGGCCG CCCCGGATCC ATGCGAGGCT
CTCGGTCGCG TCGCGCGGAT GGGCCGGCGG ATGGCCGCCG CGGACCTGTC CCTGGTCGCC
GTGCCCGACG AGCACGGCAC CCTGGTCGTG ACCGCCGCCT CCGTAGCCCC CGACCTTCCC
GCCGACGCCG ACGCCCTACC AGGGCTGACC GTCACTCTCC GCGCCGCGCC AGGAGCCGGC
GTGCTCAACG CAGGCCACGC CGTTGTCCTG GACGGCCTAC AGCTGCGGGG GAACCGGGTC
GTCCACGCGC CGACGCTGCC CCTCGCCGCG GCGCTGGCCG TGCCGCTACG AATCGCCGGG
CATCAGGCGG CGATCCTTCT CCTCGCCCAC CGGAGCAGCA CGGCTCGCCT TCCGGCGCGC
GACATCGAAC TGGTCGAGGG CTTCGCCGCC GACGCCGCGC TCGCCCTCGA ACTAGCCCAG
GCACACCAGG AACGTGCCCG CCTGGCGGTC TTCGAGGACC GCGACCGCAT CGCCCGCGAC
CTCCATGACC TGGTCCTCCA ACGGCTGTTC GCCATCGGCC TGCACCTGCA GTCCCTAGCC
CGCGCCGCCG GCGACATGAT CGGCGCACGA CTCACCGCGG CCATCGACGC CCTCGACCAG
ACCATCGAGG AAATCCGCCG CACAATCCTG CAGGTACAAC CCTCATGA
 
Protein sequence
MTSASRGATR RQGPCRTGAG HGGDTAGCCD AGDLLDSLGE VVFRTDAEGC WTYLNRAWTT 
LTGFDVVASL GAQFLDYVHP DEVEATVALF MGVVAGGADH CHHETRYRTR DGSYRRVQIR
ATVLRDAAGV VVGNTGTILD VTAARRDAQT VGEHAALLEL VSAGVPAGEL PVGVAVYDAD
LRLWRGSPVL DQIARVPQRV GDRLAQLAER LSPAAGPRQR SLGGEWGMIA AALRTHHAQV
GDLDLVGERG TGRSMRVTVI PYVRGGEEML ALVFTDITDL RRAERELRRL YEQAERSRAW
LAASTEITTS VLAAPDPCEA LGRVARMGRR MAAADLSLVA VPDEHGTLVV TAASVAPDLP
ADADALPGLT VTLRAAPGAG VLNAGHAVVL DGLQLRGNRV VHAPTLPLAA ALAVPLRIAG
HQAAILLLAH RSSTARLPAR DIELVEGFAA DAALALELAQ AHQERARLAV FEDRDRIARD
LHDLVLQRLF AIGLHLQSLA RAAGDMIGAR LTAAIDALDQ TIEEIRRTIL QVQPS